Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Human-induced arsenic pollution modeling in surface waters : An integrated approach using machine learning algorithms and environmental factors

Mohammadi, Maziar ; Naghibi, Seyed Amir LU ; Motevalli, Alireza LU and Hashemi, Hossein LU orcid (2022) In Journal of Environmental Management 305.
Abstract
In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to... (More)
In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages. (Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Journal of Environmental Management
volume
305
pages
12 pages
publisher
Elsevier
external identifiers
  • scopus:85121592217
  • pmid:34954681
ISSN
0301-4797
DOI
10.1016/j.jenvman.2021.114347
language
English
LU publication?
yes
id
d4ecc18f-dedb-465c-a974-dbe7c22f0b46
date added to LUP
2022-02-11 14:44:36
date last changed
2023-10-09 00:44:23
@article{d4ecc18f-dedb-465c-a974-dbe7c22f0b46,
  abstract     = {{In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations &gt;8 ppm and &lt;8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages.}},
  author       = {{Mohammadi, Maziar and Naghibi, Seyed Amir and Motevalli, Alireza and Hashemi, Hossein}},
  issn         = {{0301-4797}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Journal of Environmental Management}},
  title        = {{Human-induced arsenic pollution modeling in surface waters : An integrated approach using machine learning algorithms and environmental factors}},
  url          = {{http://dx.doi.org/10.1016/j.jenvman.2021.114347}},
  doi          = {{10.1016/j.jenvman.2021.114347}},
  volume       = {{305}},
  year         = {{2022}},
}