ABSTRACT
Participatory sensing has emerged as a new data collection paradigm, in which humans use their own devices (cell phone accelerometers, cameras, etc.) as sensors. This paradigm enables to collect a huge amount of data from the crowd for world-wide applications, without spending cost to buy dedicated sensors. Despite of this benefit, the data collected from human sensors are inherently uncertain due to no quality guarantee from the participants. Moreover, the participatory sensing data are time series that not only exhibit highly irregular dependencies on time, but also vary from sensor to sensor. To overcome these issues, we study in this paper the problem of reconciling probabilistic data from given (uncertain) time series collected by participatory sensors. More precisely, an iterative process is executed in which we exchange between two mutual reinforcing routines: (i) aggregating probabilistic time series from multiple sensors and expert input, (ii) validating them by expert knowledge with minimal effort. Through extensive experimentation, we demonstrate the efficiency and effectiveness of our approach on both real data and synthetic data.
- Reynold Cheng, Dmitri V Kalashnikov, and Sunil Prabhakar. 2003. Evaluating probabilistic queries over imprecise data. In SIGMOD. 551--562. Google ScholarDigital Library
- Graham Cormode and Minos Garofalakis. 2007. Sketching probabilistic data streams. In SIGMOD. 281--292. Google ScholarDigital Library
- Nilesh Dalvi and Dan Suciu. 2007. Efficient query evaluation on probabilistic databases. JVLDB (2007), 523--544. Google ScholarDigital Library
- Charles Miller Grinstead and James Laurie Snell. 1998. Introduction to probability. American Mathematical Soc.Google Scholar
- Ming Hua, Jian Pei, Wenjie Zhang, and Xuemin Lin. 2008. Ranking queries on uncertain data: a probabilistic threshold approach. In SIGMOD. 673--686. Google ScholarDigital Library
- Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Lam Ngoc Tran, and Karl Aberer. 2013. An Evaluation of Aggregation Techniques in Crowdsourcing. In WISE. 1--15.Google Scholar
- Shawn R. Jeffery, Michael J. Franklin, and Alon Y. Halevy. 2008. Pay-as-you-go user feedback for dataspace systems. In SIGMOD. 847--860. Google ScholarDigital Library
- Yaguang Li, Han Su, Ugur Demiryurek, Bolong Zheng, Tieke He, and Cyrus Shahabi. 2017. PaRE: A System for Personalized Route Guidance. In WWW. 637--646. Google ScholarDigital Library
- Min Mun, Sasank Reddy, Katie Shilton, Nathan Yau, Jeff Burke, Deborah Estrin, Mark Hansen, Eric Howard, Ruth West, and Péter Boda. 2009. PEIR, the personal environmental impact report, as a platform for participatory sensing systems research. In MobiSys. 55--68. Google ScholarDigital Library
- Dan Olteanu, Jiewen Huang, and Christoph Koch. 2009. Sprout: Lazy vs. eager query plans for tuple-independent probabilistic databases. In ICDE. 640--651. Google ScholarDigital Library
- Aditya G Parameswaran, Hector Garcia-Molina, Hyunjung Park, Neoklis Polyzotis, Aditya Ramesh, and Jennifer Widom. 2012. Crowdscreen: Algorithms for filtering data with humans. In SIGMOD. 361--372. Google ScholarDigital Library
- Christopher Ré, Julie Letchner, Magdalena Balazinksa, and Dan Suciu. 2008. Event queries on correlated probabilistic streams. In SIGMOD. 715--728. Google ScholarDigital Library
- Sasank Reddy, Katie Shilton, Gleb Denisov, Christian Cenizal, Deborah Estrin, and Mani Srivastava. 2010. Biketastic: sensing and mapping for better biking. In CHI. 1817--1820. Google ScholarDigital Library
- Neil Rubens, Mehdi Elahi, Masashi Sugiyama, and Dain Kaplan. 2015. Active learning in recommender systems. In Recommender systems handbook. 809--846.Google Scholar
- Stuart J. Russell and Peter Norvig. 2003. Artificial Intelligence: A Modern Approach. Pearson Education. Google ScholarDigital Library
- Claude E Shannon. 2001. A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review (2001), 3--55. Google ScholarDigital Library
- Yufei Tao, Reynold Cheng, Xiaokui Xiao, Wang Kay Ngai, Ben Kao, and Sunil Prabhakar. 2005. Indexing multi-dimensional uncertain data with arbitrary probability density functions. In VLDB. 922--933. Google ScholarDigital Library
- Dong Wang, Lance Kaplan, Hieu Le, and Tarek Abdelzaher. 2012. On Truth Discovery in Social Sensing: A Maximum Likelihood Estimation Approach. In IPSN. 233--244. Google ScholarDigital Library
- Mohamed Yakout, Ahmed K Elmagarmid, Jennifer Neville, Mourad Ouzzani, and Ihab FIlyas. 2011. Guided data repair. In VLDB. 279--289. Google ScholarDigital Library
- Pengfei Zhou, Yuanqing Zheng, and Mo Li. 2012. How Long to Wait?: Predicting Bus Arrival Time with Mobile Phone Based Participatory Sensing. In MobiSys. 379--392. Google ScholarDigital Library
Index Terms
- Minimizing Efforts in Reconciling Participatory Sensing Data
Recommendations
Efficient User Guidance for Validating Participatory Sensing Data
Survey Papers and Regular PapersParticipatory sensing has become a new data collection paradigm that leverages the wisdom of the crowd for big data applications without spending cost to buy dedicated sensors. It collects data from human sensors by using their own devices such as cell ...
Participatory sensing for community building
CHI EA '11: CHI '11 Extended Abstracts on Human Factors in Computing SystemsIn this research, we explore the viability of using participatory sensing as a means to enhance a sense of community. To accomplish this, we are developing and deploying a suite of participatory sensing applications, where users explicitly report on the ...
Big data management in participatory sensing: Issues, trends and future directions
AbstractParticipatory sensing has become an emerging technology of this era owing to its low cost in big sensor data collection. Prior to participatory sensing, large-scale deployment complexities were found in wireless sensor networks when ...
Highlights- This paper highlights big data management issues in participatory sensing.
- We ...
Comments