skip to main content
10.1145/3227609.3227678acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

Minimizing Efforts in Reconciling Participatory Sensing Data

Published:25 June 2018Publication History

ABSTRACT

Participatory sensing has emerged as a new data collection paradigm, in which humans use their own devices (cell phone accelerometers, cameras, etc.) as sensors. This paradigm enables to collect a huge amount of data from the crowd for world-wide applications, without spending cost to buy dedicated sensors. Despite of this benefit, the data collected from human sensors are inherently uncertain due to no quality guarantee from the participants. Moreover, the participatory sensing data are time series that not only exhibit highly irregular dependencies on time, but also vary from sensor to sensor. To overcome these issues, we study in this paper the problem of reconciling probabilistic data from given (uncertain) time series collected by participatory sensors. More precisely, an iterative process is executed in which we exchange between two mutual reinforcing routines: (i) aggregating probabilistic time series from multiple sensors and expert input, (ii) validating them by expert knowledge with minimal effort. Through extensive experimentation, we demonstrate the efficiency and effectiveness of our approach on both real data and synthetic data.

References

  1. Reynold Cheng, Dmitri V Kalashnikov, and Sunil Prabhakar. 2003. Evaluating probabilistic queries over imprecise data. In SIGMOD. 551--562. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Graham Cormode and Minos Garofalakis. 2007. Sketching probabilistic data streams. In SIGMOD. 281--292. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Nilesh Dalvi and Dan Suciu. 2007. Efficient query evaluation on probabilistic databases. JVLDB (2007), 523--544. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Charles Miller Grinstead and James Laurie Snell. 1998. Introduction to probability. American Mathematical Soc.Google ScholarGoogle Scholar
  5. Ming Hua, Jian Pei, Wenjie Zhang, and Xuemin Lin. 2008. Ranking queries on uncertain data: a probabilistic threshold approach. In SIGMOD. 673--686. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Nguyen Quoc Viet Hung, Nguyen Thanh Tam, Lam Ngoc Tran, and Karl Aberer. 2013. An Evaluation of Aggregation Techniques in Crowdsourcing. In WISE. 1--15.Google ScholarGoogle Scholar
  7. Shawn R. Jeffery, Michael J. Franklin, and Alon Y. Halevy. 2008. Pay-as-you-go user feedback for dataspace systems. In SIGMOD. 847--860. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Yaguang Li, Han Su, Ugur Demiryurek, Bolong Zheng, Tieke He, and Cyrus Shahabi. 2017. PaRE: A System for Personalized Route Guidance. In WWW. 637--646. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Min Mun, Sasank Reddy, Katie Shilton, Nathan Yau, Jeff Burke, Deborah Estrin, Mark Hansen, Eric Howard, Ruth West, and Péter Boda. 2009. PEIR, the personal environmental impact report, as a platform for participatory sensing systems research. In MobiSys. 55--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dan Olteanu, Jiewen Huang, and Christoph Koch. 2009. Sprout: Lazy vs. eager query plans for tuple-independent probabilistic databases. In ICDE. 640--651. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Aditya G Parameswaran, Hector Garcia-Molina, Hyunjung Park, Neoklis Polyzotis, Aditya Ramesh, and Jennifer Widom. 2012. Crowdscreen: Algorithms for filtering data with humans. In SIGMOD. 361--372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Christopher Ré, Julie Letchner, Magdalena Balazinksa, and Dan Suciu. 2008. Event queries on correlated probabilistic streams. In SIGMOD. 715--728. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Sasank Reddy, Katie Shilton, Gleb Denisov, Christian Cenizal, Deborah Estrin, and Mani Srivastava. 2010. Biketastic: sensing and mapping for better biking. In CHI. 1817--1820. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Neil Rubens, Mehdi Elahi, Masashi Sugiyama, and Dain Kaplan. 2015. Active learning in recommender systems. In Recommender systems handbook. 809--846.Google ScholarGoogle Scholar
  15. Stuart J. Russell and Peter Norvig. 2003. Artificial Intelligence: A Modern Approach. Pearson Education. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Claude E Shannon. 2001. A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review (2001), 3--55. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Yufei Tao, Reynold Cheng, Xiaokui Xiao, Wang Kay Ngai, Ben Kao, and Sunil Prabhakar. 2005. Indexing multi-dimensional uncertain data with arbitrary probability density functions. In VLDB. 922--933. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Dong Wang, Lance Kaplan, Hieu Le, and Tarek Abdelzaher. 2012. On Truth Discovery in Social Sensing: A Maximum Likelihood Estimation Approach. In IPSN. 233--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Mohamed Yakout, Ahmed K Elmagarmid, Jennifer Neville, Mourad Ouzzani, and Ihab FIlyas. 2011. Guided data repair. In VLDB. 279--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Pengfei Zhou, Yuanqing Zheng, and Mo Li. 2012. How Long to Wait?: Predicting Bus Arrival Time with Mobile Phone Based Participatory Sensing. In MobiSys. 379--392. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Minimizing Efforts in Reconciling Participatory Sensing Data

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WIMS '18: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics
          June 2018
          398 pages

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 June 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate140of278submissions,50%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader