skip to main content
10.1145/2534921.2534924acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

When big data meets big smog: a big spatio-temporal data framework for China severe smog analysis

Authors Info & Claims
Published:04 November 2013Publication History

ABSTRACT

Recently, the appearing disaster of severe smog has been attacking many cities in China such as the capital Beijing. The chief culprit of China smog, namely PM2.5, is affected by various factors including air pollutants, weather, climate, geographical location, urbanization, etc. To analyze the factors, we collect about 35,000,000 air quality records and about 30,000,000 weather records from the sensors in 77 China's cities in 2013. Moreover, two big data sets named Geoname and DBPedia are also combined for the data of climate, geographical location and urbanization. To deal with big spatio-temporal data for big smog analysis, we propose a MapReduce-based framework named BigSmog. It mainly conducts parallel correlation analysis of the factors and scalable training of artificial neural networks for spatio-temporal approximation of the concentration of PM2.5. In the experiments, BigSmog displays high scalability for big smog analysis with big spatio-temporal data. The analysis result shows that the air pollutants influence the short-term concentration of PM2.5 more than the weather and the factors of geographical location and climate rather than urbanization play a major role in determining a city's long-term pollution level of PM2.5. Moreover, the trained ANNs can accurately approximate the concentration of PM2.5.

References

  1. D. Borthakur. Hdfs architecture guide. Hadoop Apache Project. http://hadoop.apache.org/common/docs/current/hdfs_design.pdf, 2008.Google ScholarGoogle Scholar
  2. H. Che, X. Zhang, Y. Li, Z. Zhou, J. Qu, and X. Hao. Haze trends over the capital cities of 31 provinces in china, 1981âĂŞ2005. Theoretical and Applied Climatology, 97(3-4):235--242, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  3. D. A. Chu, Y. Kaufman, G. Zibordi, J. Chern, J. Mao, C. Li, and B. Holben. Global monitoring of air pollution over land from the earth observing system-terra moderate resolution imaging spectroradiometer (modis). Journal of Geophysical Research: Atmospheres (1984--2012), 108(D21), 2003.Google ScholarGoogle Scholar
  4. J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 51(1):107--113, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Dominick, H. Juahir, M. T. Latif, S. M. Zain, and A. Z. Aris. Spatial assessment of air quality patterns in malaysia using multivariate analysis. Atmospheric Environment, 60(0):172--181, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  6. G. Grivas and A. Chaloulakou. Artificial neural network models for prediction of {PM10} hourly concentrations, in the greater area of athens, greece. Atmospheric Environment, 40(7):1216--1229, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  7. D. Hasenfratz, O. Saukh, S. Sturzenegger, and L. Thiele. Participatory air pollution monitoring using smartphones. In Proc. 1st IntâĂŹl Workshop on Mobile Sensing: From Smartphones and Wearables to Big Data, 2012.Google ScholarGoogle Scholar
  8. J. Hooyberghs, C. Mensink, G. Dumont, F. Fierens, and O. Brasseur. A neural network forecast for daily average {PM10} concentrations in belgium. Atmospheric Environment, 39(18):3279--3289, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  9. G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew. Extreme learning machine: a new learning scheme of feedforward neural networks. In International Symposium on Neural Networks, volume 2, 2004.Google ScholarGoogle Scholar
  10. G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew. Extreme learning machine: Theory and applications. Neurocomputing, 70(1âĂŞ3):489--501, 2006.Google ScholarGoogle Scholar
  11. H. Jing, X. Lihua, G. Yuling, and W. Xiaoxuan. Remote sensing monitoring on photochemical pollution caused by haze in pearl river delta. In Urban Remote Sensing Joint Event, 2009.Google ScholarGoogle Scholar
  12. O. Kunii, S. Kanagawa, I. Yajima, Y. Hisamatsu, S. Yamamura, T. Amagai, and I. T. S. Ismail. The 1997 haze disaster in indonesia: Its air quality and health effects. Archives of Environmental Health: An International Journal, 57(1):16--22, 2002. PMID: 12071356.Google ScholarGoogle Scholar
  13. J. Lee Rodgers and W. A. Nicewander. Thirteen ways to look at the correlation coefficient. The American Statistician, 42(1):59--66, 1988.Google ScholarGoogle ScholarCross RefCross Ref
  14. S. Li, L. Chen, F. Zheng, D. Han, and Z. Wang. Design and application of haze optic thickness retrieval model for beijing olympic games. In Geoscience and Remote Sensing Symposium, 2009 IEEE International, IGARSS 2009, volume 2, pages II--507--II--510, 2009.Google ScholarGoogle Scholar
  15. I. G. McKendry. Evaluation of artificial neural networks for fine particulate pollution (pm10 and pm2.5) forecasting. Journal of the Air & Waste Management Association, 52(9):1096--1101, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Patience. Beijing smog: When growth trumps life in china. BBC News Magazine, 2013.Google ScholarGoogle Scholar
  17. Y. Sun, G. Zhuang, A. Tang, Y. Wang, and Z. An. Chemical characteristics of pm2. 5 and pm10 in haze-fog episodes in beijing. Environmental science & technology, 40(10):3148--3155, 2006.Google ScholarGoogle Scholar
  18. S. N. Syed Abdul Mutalib, H. Juahir, A. Azid, S. Mohammad Sharif, M. T. Latif, A. Z. Aris, S. Md Zain, and D. Dominick. Spatial and temporal air quality pattern recognition using environmetric techniques: a case study in malaysia. Environ. Sci.: Processes Impacts, pages --, 2013.Google ScholarGoogle Scholar
  19. J.-H. Tan, J.-C. Duan, D.-H. Chen, X.-H. Wang, S.-J. Guo, X.-H. Bi, G.-Y. Sheng, K.-B. He, and J.-M. Fu. Chemical characteristics of haze during summer and winter in guangzhou. Atmospheric Research, 94(2):238--245, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  20. A. Van Donkelaar, R. V. Martin, and R. J. Park. Estimating ground-level pm2. 5 using aerosol optical depth determined from satellite remote sensing. Journal of Geophysical Research: Atmospheres (1984--2012), 111(D21), 2006.Google ScholarGoogle Scholar
  21. S. Vardoulakis, B. E. Fisher, K. Pericleous, and N. Gonzalez-Flesca. Modelling air quality in street canyons: a review. Atmospheric Environment, 37(2):155--182, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  22. C. Xu, Z. Wang, S. Li, and H. Chen. A haze monitoring over north china plain. In Geoscience and Remote Sensing Symposium (IGARSS), 2012 IEEE International, pages 2474--2477, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  23. Y. Zheng, F. Liu, and H.-P. Hsie. U-air: When urban air quality inference meets big data. 2013.Google ScholarGoogle Scholar

Index Terms

  1. When big data meets big smog: a big spatio-temporal data framework for China severe smog analysis

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            BigSpatial '13: Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
            November 2013
            86 pages
            ISBN:9781450325349
            DOI:10.1145/2534921

            Copyright © 2013 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 4 November 2013

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate32of58submissions,55%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader