Hadoop-based distributed system for online prediction of air pollution based on Support Vector Machine
(2015) ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015 In International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives 40. p.215-219- Abstract
The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach... (More)
The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach based on Support Vector Machine (SVM) to predict the air quality one day in advance. In order to overcome the computational requirements for large-scale data analysis, distributed computing based on the Hadoop platform has been employed to leverage the processing power of multiple processing units. The MapReduce programming model is adopted for massive parallel processing in this study. Based on the online algorithm and Hadoop framework, an online forecasting system is designed to predict the air pollution of Tehran for the next 24 hours. The results have been assessed on the basis of Processing Time and Efficiency. Quite accurate predictions of air pollutant indicator levels within an acceptable processing time prove that the presented approach is very suitable to tackle large scale air pollution prediction problems.
(Less)
- author
- Ghaemi, Z. ; Farnaghi, M. LU and Alimohammadi, A.
- publishing date
- 2015
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- keywords
- Big data, Distributed computing, Online prediction, Spatial analysis, Support Vector Machine, Urban air pollution
- host publication
- ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015
- series title
- International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
- volume
- 40
- edition
- 1W5
- pages
- 5 pages
- conference name
- ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015
- conference location
- Kish Island, Iran, Islamic Republic of
- conference dates
- 2015-11-23 - 2015-11-25
- external identifiers
-
- scopus:84974578128
- ISSN
- 1682-1750
- DOI
- 10.5194/isprsarchives-XL-1-W5-215-2015
- language
- English
- LU publication?
- no
- id
- 64d79e8d-f557-432e-a8b7-ac3484e1d331
- date added to LUP
- 2019-05-06 12:12:43
- date last changed
- 2022-03-10 03:53:31
@inproceedings{64d79e8d-f557-432e-a8b7-ac3484e1d331, abstract = {{<p>The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach based on Support Vector Machine (SVM) to predict the air quality one day in advance. In order to overcome the computational requirements for large-scale data analysis, distributed computing based on the Hadoop platform has been employed to leverage the processing power of multiple processing units. The MapReduce programming model is adopted for massive parallel processing in this study. Based on the online algorithm and Hadoop framework, an online forecasting system is designed to predict the air pollution of Tehran for the next 24 hours. The results have been assessed on the basis of Processing Time and Efficiency. Quite accurate predictions of air pollutant indicator levels within an acceptable processing time prove that the presented approach is very suitable to tackle large scale air pollution prediction problems.</p>}}, author = {{Ghaemi, Z. and Farnaghi, M. and Alimohammadi, A.}}, booktitle = {{ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015}}, issn = {{1682-1750}}, keywords = {{Big data; Distributed computing; Online prediction; Spatial analysis; Support Vector Machine; Urban air pollution}}, language = {{eng}}, pages = {{215--219}}, series = {{International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives}}, title = {{Hadoop-based distributed system for online prediction of air pollution based on Support Vector Machine}}, url = {{http://dx.doi.org/10.5194/isprsarchives-XL-1-W5-215-2015}}, doi = {{10.5194/isprsarchives-XL-1-W5-215-2015}}, volume = {{40}}, year = {{2015}}, }