A case study using support vector machines, neural networks and logistic regression in a GIS to identify wells contaminated with nitrate-N |
| |
Authors: | Barnali Dixon |
| |
Affiliation: | 1. Geospatial Analytics Lab, Dept. of Environmental Science and Policy, University of South Florida St. Petersburg, 140 Seventh Ave South, PNM 105, St. Petersburg, FL, 33701, USA
|
| |
Abstract: | Accurate and inexpensive identification of potentially contaminated wells is critical for water resources protection and management. The objectives of this study are to 1) assess the suitability of approximation tools such as neural networks (NN) and support vector machines (SVM) integrated in a geographic information system (GIS) for identifying contaminated wells and 2) use logistic regression and feature selection methods to identify significant variables for transporting contaminants in and through the soil profile to the groundwater. Fourteen GIS derived soil hydrogeologic and landuse parameters were used as initial inputs in this study. Well water quality data (nitrate-N) from 6,917 wells provided by Florida Department of Environmental Protection (USA) were used as an output target class. The use of the logistic regression and feature selection methods reduced the number of input variables to nine. Receiver operating characteristics (ROC) curves were used for evaluation of these approximation tools. Results showed superior performance with the NN as compared to SVM especially on training data while testing results were comparable. Feature selection did not improve accuracy; however, it helped increase the sensitivity or true positive rate (TPR). Thus, a higher TPR was obtainable with fewer variables. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|