Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Hadoop-based distributed system for online prediction of air pollution based on Support Vector Machine

Ghaemi, Z. ; Farnaghi, M. LU and Alimohammadi, A. (2015) ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015 In International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives 40. p.215-219
Abstract

The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach... (More)

The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach based on Support Vector Machine (SVM) to predict the air quality one day in advance. In order to overcome the computational requirements for large-scale data analysis, distributed computing based on the Hadoop platform has been employed to leverage the processing power of multiple processing units. The MapReduce programming model is adopted for massive parallel processing in this study. Based on the online algorithm and Hadoop framework, an online forecasting system is designed to predict the air pollution of Tehran for the next 24 hours. The results have been assessed on the basis of Processing Time and Efficiency. Quite accurate predictions of air pollutant indicator levels within an acceptable processing time prove that the presented approach is very suitable to tackle large scale air pollution prediction problems.

(Less)
Please use this url to cite or link to this publication:
author
; and
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
keywords
Big data, Distributed computing, Online prediction, Spatial analysis, Support Vector Machine, Urban air pollution
host publication
ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015
series title
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
volume
40
edition
1W5
pages
5 pages
conference name
ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015
conference location
Kish Island, Iran, Islamic Republic of
conference dates
2015-11-23 - 2015-11-25
external identifiers
  • scopus:84974578128
ISSN
1682-1750
DOI
10.5194/isprsarchives-XL-1-W5-215-2015
language
English
LU publication?
no
id
64d79e8d-f557-432e-a8b7-ac3484e1d331
date added to LUP
2019-05-06 12:12:43
date last changed
2022-03-10 03:53:31
@inproceedings{64d79e8d-f557-432e-a8b7-ac3484e1d331,
  abstract     = {{<p>The critical impact of air pollution on human health and environment in one hand and the complexity of pollutant concentration behavior in the other hand lead the scientists to look for advance techniques for monitoring and predicting the urban air quality. Additionally, recent developments in data measurement techniques have led to collection of various types of data about air quality. Such data is extremely voluminous and to be useful it must be processed at high velocity. Due to the complexity of big data analysis especially for dynamic applications, online forecasting of pollutant concentration trends within a reasonable processing time is still an open problem. The purpose of this paper is to present an online forecasting approach based on Support Vector Machine (SVM) to predict the air quality one day in advance. In order to overcome the computational requirements for large-scale data analysis, distributed computing based on the Hadoop platform has been employed to leverage the processing power of multiple processing units. The MapReduce programming model is adopted for massive parallel processing in this study. Based on the online algorithm and Hadoop framework, an online forecasting system is designed to predict the air pollution of Tehran for the next 24 hours. The results have been assessed on the basis of Processing Time and Efficiency. Quite accurate predictions of air pollutant indicator levels within an acceptable processing time prove that the presented approach is very suitable to tackle large scale air pollution prediction problems.</p>}},
  author       = {{Ghaemi, Z. and Farnaghi, M. and Alimohammadi, A.}},
  booktitle    = {{ISPRS International Conference on Sensors and Models in Remote Sensing and Photogrammetry 2015}},
  issn         = {{1682-1750}},
  keywords     = {{Big data; Distributed computing; Online prediction; Spatial analysis; Support Vector Machine; Urban air pollution}},
  language     = {{eng}},
  pages        = {{215--219}},
  series       = {{International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives}},
  title        = {{Hadoop-based distributed system for online prediction of air pollution based on Support Vector Machine}},
  url          = {{http://dx.doi.org/10.5194/isprsarchives-XL-1-W5-215-2015}},
  doi          = {{10.5194/isprsarchives-XL-1-W5-215-2015}},
  volume       = {{40}},
  year         = {{2015}},
}