skip to main content
10.1145/3124680.3124746acmconferencesArticle/Chapter ViewAbstractPublication PagesapsysConference Proceedingsconference-collections
research-article

Scaling Up IoT Stream Processing

Published:02 September 2017Publication History

ABSTRACT

Users create large numbers of IoT stream queries with data streams generated from various IoT devices. Current stream processing systems such as Storm and Flink are unable to support such large numbers of IoT stream queries efficiently, as their execution models cause a flurry of cache misses while processing the events of the queries. To solve this problem, we present a new group-aware execution model, which processes the events of IoT stream queries in a way that exploits the locality of data and code references, to reduce cache misses and improve system performance. The group-aware execution model leverages the fact that users create the groups of queries according to their interests or location contexts and that queries in the same group can share the same data and codes. We realize the group-aware execution model on MIST---a new stream processing system tailored for processing many IoT stream queries efficiently---to scale up the number of IoT queries that can be processed in a machine. Our preliminary evaluation shows that our group-aware execution model increases the number of queries that can be processed within a single machine up to 3.18X compared to the Flink-based execution model.

References

  1. Daniel J Abadi, Yanif Ahmad, Magdalena Balazinska, Ugur Cetintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag Maskey, Alex Rasin, Esther Ryvkina, et al. 2005. The Design of the Borealis Stream Processing Engine. In CIDR.Google ScholarGoogle Scholar
  2. Daniel J Abadi, Don Carney, Ugur Çetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik. 2003. Aurora: a new model and architecture for data stream management. VLDB 12, 2 (2003), 120--139.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Keith Ito, Itaru Nishizawa, Justin Rosenstein, and Jennifer Widom. 2003. STREAM: the stanford stream data manager (demonstration description). In ACM SIGMOD.Google ScholarGoogle Scholar
  4. Sirish Chandrasekaran, Owen Cooper, Amol Deshpande, Michael J Franklin, Joseph M Hellerstein, Wei Hong, Sailesh Krishnamurthy, Samuel R Madden, Fred Reiss, and Mehul A Shah. 2003. TelegraphCQ: continuous dataflow processing. In ACM SIGMOD.Google ScholarGoogle Scholar
  5. Jianjun Chen, David J DeWitt, Feng Tian, and Yuan Wang. 2000. NiagaraCQ: A scalable continuous query system for internet databases. In ACM SIGMOD.Google ScholarGoogle Scholar
  6. Robert Gallager Dimitri Bertsekas. 1992. Data Networks (2nd ed.). Prentice Hall.Google ScholarGoogle Scholar
  7. EMQ Enterprise. 2017. EMQ - Erlang MQTT Broker. http://emqtt.io/docs/v2/index.html. (2017).Google ScholarGoogle Scholar
  8. Apache Flink. 2017. Apache Flink: Scalable Stream and Batch Data Processing. https://flink.apache.org. (2017).Google ScholarGoogle Scholar
  9. Lukasz Golab, Kumar Gaurav Bijay, and M Tamer Özsu. 2006. Multiquery optimization of sliding window aggregates by schedule synchronization. In CIKM.Google ScholarGoogle Scholar
  10. Mahanth Gowda, Ashutosh Dhekne, Sheng Shen, Romit Roy Choudhury, Lei Yang, Suresh Golwalkar, and Alexander Essanian. 2017. Bringing IoT to Sports Analytics. In NSDI.Google ScholarGoogle Scholar
  11. Trinabh Gupta, Rayman Preet Singh, Amar Phanishayee, Jaeyeon Jung, and Ratul Mahajan. 2014. Bolt: Data Management for Connected Homes.. In NSDI.Google ScholarGoogle Scholar
  12. IFTTT. 2017. IFTTT. https://ifttt.com/about. (2017).Google ScholarGoogle Scholar
  13. A. Khanna and R. Anand. 2016. IoT based smart parking system. In IOTA. Google ScholarGoogle ScholarCross RefCross Ref
  14. J. Kreps, N. Narkhede, and J. Rao. 2011. Kafka: A distributed messaging system for log processing. In NetDB.Google ScholarGoogle Scholar
  15. Nest Labs. 2017. Nest. https://nest.com/. (2017).Google ScholarGoogle Scholar
  16. Samuel Madden, Michael J Franklin, Joseph M Hellerstein, and Wei Hong. 2002. TAG: A tiny aggregation service for ad-hoc sensor networks. In OSDI.Google ScholarGoogle ScholarCross RefCross Ref
  17. Samuel Madden, Mehul Shah, Joseph M Hellerstein, and Vijayshankar Raman. 2002. Continuously adaptive continuous queries over streams. In ACM SIGMOD.Google ScholarGoogle Scholar
  18. Samuel R Madden, Michael J Franklin, Joseph M Hellerstein, and Wei Hong. 2005. TinyDB: an acquisitional query processing system for sensor networks. ACM TODS 30, 1 (2005), 122--173.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Microsoft. 2017. Azure IoT Suite. https://www.microsoft.com/en-us/cloud-platform/internet-of-things-azure-iot-suite. (2017).Google ScholarGoogle Scholar
  20. KVM Naidu, Rajeev Rastogi, Scott Satkin, and Anand Srinivasan. 2011. Memory-constrained aggregate computation over data streams. In ICDE.Google ScholarGoogle Scholar
  21. Attila Reiss and Didier Stricker. 2012. Creating and benchmarking a new dataset for physical activity monitoring. In ACM PETRA.Google ScholarGoogle Scholar
  22. Amazon Web Services. 2017. AWS Internet of Things. https://aws.amazon.com/iot/?nc1=h_ls. (2017).Google ScholarGoogle Scholar
  23. Ankit Toshniwal, Siddarth Taneja, Amit Shukla, Karthik Ramasamy, Jignesh M Patel, Sanjeev Kulkarni, Jason Jackson, Krishna Gade, Maosong Fu, Jake Donham, et al. 2014. Storm@ twitter. In ACM SIGMOD.Google ScholarGoogle Scholar
  24. Deepak Vasisht, Zerina Kapetanovic, Jongho Won, Xinxin Jin, Ranveer Chandra, Sudipta Sinha, Ashish Kapoor, Madhusudhan Sudarshan, and Sean Stratman. 2017. FarmBeats: An IoT Platform for Data-Driven Agriculture. In NSDI.Google ScholarGoogle Scholar
  25. Song Wang, Elke Rundensteiner, Samrat Ganguly, and Sudeept Bhatnagar. 2006. State-slice: New paradigm of multi-query optimization of window-based stream queries. In VLDB.Google ScholarGoogle Scholar
  26. Matt Welsh, David Culler, and Eric Brewer. 2001. SEDA: an architecture for well-conditioned, scalable internet services. In SIGOPS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Matei Zaharia, Tathagata Das, Haoyuan Li, Timothy Hunter, Scott Shenker, and Ion Stoica. 2013. Discretized streams: Fault-tolerant streaming computation at scale. In SOSP.Google ScholarGoogle Scholar
  28. Rui Zhang, Nick Koudas, Beng Chin Ooi, and Divesh Srivastava. 2005. Multiple aggregations over data streams. In ACM SIGMOD.Google ScholarGoogle Scholar
  29. Rui Zhang, Nick Koudas, Beng Chin Ooi, Divesh Srivastava, and Pu Zhou. 2010. Streaming multiple aggregations using phantoms. VLDB 19, 4 (2010), 557--583. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yu Zheng, Xing Xie, and Wei-Ying Ma. 2010. GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory. IEEE Data Eng. Bull. 33, 2 (2010), 32--39.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    APSys '17: Proceedings of the 8th Asia-Pacific Workshop on Systems
    September 2017
    207 pages
    ISBN:9781450351973
    DOI:10.1145/3124680

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 2 September 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    APSys '17 Paper Acceptance Rate27of51submissions,53%Overall Acceptance Rate149of386submissions,39%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader