TY - JOUR
T1 - Using machine learning to identify karst sinkholes from LiDAR-derived topographic depressions in the Bluegrass Region of Kentucky
AU - Zhu, Junfeng
AU - Nolte, Adam M.
AU - Jacobs, Nathan
AU - Ye, Ming
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/9
Y1 - 2020/9
N2 - Information about the distribution and characteristics of existing sinkholes is critical for understanding karst aquifer systems and evaluating sinkhole hazards. LiDAR provides accurate and high-resolution topographic information and has been used to improve delineation of sinkholes in many karst regions. LiDAR data also reveal many topographic depressions, however, and identifying sinkholes from these depressions through manual visual inspection can be slow and laborious. To improve the efficiency of the identification process, we applied six machine learning methods (logistic regression, naive Bayes, neural network, random forests, RUSBoost, and support vector machine) to a dataset of morphometric characteristics of LiDAR-derived topographic depressions. Sinkhole data from Bourbon, Woodford, and Jessamine Counties in the Bluegrass Region of Kentucky were used to derive the dataset for training and testing the machine learning methods. The dataset consisted of 22,884 records with 10 variables for each record. For each method, a random subset of 80% of the records was used for training and the remaining 20% was used for testing. The test receiver operating characteristic curves showed that all six methods were applicable to the dataset, as demonstrated by all area under the curves (AUCs) being greater than 0.87. Neural network emerged as the method that performed best, with an AUC of 0.95 and a testing average accuracy of 0.85. To further improve the sinkhole mapping process, we subsequently developed a two-step process that combined the trained neural network classifier and manual visual inspection and applied the process to Scott County, also in the Bluegrass region. We were able to locate 97% of the sinkholes in the county by manually inspecting only 27% of the topographic depressions the neural network classified as having relatively high probabilities of being sinkholes. This study showed that machine learning is a promising method for improving sinkhole identification efficiency in karst areas in which high-resolution topographic information is available.
AB - Information about the distribution and characteristics of existing sinkholes is critical for understanding karst aquifer systems and evaluating sinkhole hazards. LiDAR provides accurate and high-resolution topographic information and has been used to improve delineation of sinkholes in many karst regions. LiDAR data also reveal many topographic depressions, however, and identifying sinkholes from these depressions through manual visual inspection can be slow and laborious. To improve the efficiency of the identification process, we applied six machine learning methods (logistic regression, naive Bayes, neural network, random forests, RUSBoost, and support vector machine) to a dataset of morphometric characteristics of LiDAR-derived topographic depressions. Sinkhole data from Bourbon, Woodford, and Jessamine Counties in the Bluegrass Region of Kentucky were used to derive the dataset for training and testing the machine learning methods. The dataset consisted of 22,884 records with 10 variables for each record. For each method, a random subset of 80% of the records was used for training and the remaining 20% was used for testing. The test receiver operating characteristic curves showed that all six methods were applicable to the dataset, as demonstrated by all area under the curves (AUCs) being greater than 0.87. Neural network emerged as the method that performed best, with an AUC of 0.95 and a testing average accuracy of 0.85. To further improve the sinkhole mapping process, we subsequently developed a two-step process that combined the trained neural network classifier and manual visual inspection and applied the process to Scott County, also in the Bluegrass region. We were able to locate 97% of the sinkholes in the county by manually inspecting only 27% of the topographic depressions the neural network classified as having relatively high probabilities of being sinkholes. This study showed that machine learning is a promising method for improving sinkhole identification efficiency in karst areas in which high-resolution topographic information is available.
KW - LiDAR
KW - Machine learning
KW - Morphometric characteristic
KW - Sinkhole
KW - Topographic depression
UR - http://www.scopus.com/inward/record.url?scp=85084544073&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084544073&partnerID=8YFLogxK
U2 - 10.1016/j.jhydrol.2020.125049
DO - 10.1016/j.jhydrol.2020.125049
M3 - Article
AN - SCOPUS:85084544073
SN - 0022-1694
VL - 588
JO - Journal of Hydrology
JF - Journal of Hydrology
M1 - 125049
ER -