Human-induced arsenic pollution modeling in surface waters : An integrated approach using machine learning algorithms and environmental factors
(2022) In Journal of Environmental Management 305.- Abstract
- In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to... (More)
- In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/d4ecc18f-dedb-465c-a974-dbe7c22f0b46
- author
- Mohammadi, Maziar ; Naghibi, Seyed Amir LU ; Motevalli, Alireza LU and Hashemi, Hossein LU
- organization
- publishing date
- 2022
- type
- Contribution to journal
- publication status
- published
- subject
- in
- Journal of Environmental Management
- volume
- 305
- pages
- 12 pages
- publisher
- Elsevier
- external identifiers
-
- scopus:85121592217
- pmid:34954681
- ISSN
- 0301-4797
- DOI
- 10.1016/j.jenvman.2021.114347
- language
- English
- LU publication?
- yes
- id
- d4ecc18f-dedb-465c-a974-dbe7c22f0b46
- date added to LUP
- 2022-02-11 14:44:36
- date last changed
- 2023-10-09 00:44:23
@article{d4ecc18f-dedb-465c-a974-dbe7c22f0b46, abstract = {{In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages.}}, author = {{Mohammadi, Maziar and Naghibi, Seyed Amir and Motevalli, Alireza and Hashemi, Hossein}}, issn = {{0301-4797}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Journal of Environmental Management}}, title = {{Human-induced arsenic pollution modeling in surface waters : An integrated approach using machine learning algorithms and environmental factors}}, url = {{http://dx.doi.org/10.1016/j.jenvman.2021.114347}}, doi = {{10.1016/j.jenvman.2021.114347}}, volume = {{305}}, year = {{2022}}, }