비점오염물질측정망의 탁도 예측을 위한 수질 인자를 이용한 최적 머신러닝 알고리즘 선정

Selection of the best fitted Machine Learning Algorithms for TurbidityPrediction using Water Quality Parameters in Non-point Source Pollution Monitoring Network

초록

This study was aimed to determine the best fitted machine learning model to predict the turbidity relative to water temperature, pH, EC, and DO data collected from non-point source pollution monitoring networks in case of missing data. Thus, K-NN, SVM, and Decision Tree were used to be trained. To assess the sensitivity on each algorithm to the scale of the monitoring data, both raw and normalized data sets were run. Additionally, hyperparameters were tuned to derive optimal values for each algorithm’s performance. K-fold cross-validation was employed to prevent overfitting. After tuning, the top 10 models with the highest NSE were evaluated using separate test data that was not involved in the tuning process. This allowed for further validation of the model performance using metrics such as NSE, MSE, RMSE, and MAE. The results indicated that Decision Tree algorithm achieved highest prediction accuracy followed by SVM and K-NN. Decision Tree was particularly well-suited for accurate turbidity prediction relative to other water quality monitoring data. Thus, machine learning techniques could be effectively used for predicting one of the water quality parameters when it will be partially missed or false recorded.

키워드

Non-point source pollutionMonitoring networkTurbidityMachine learningprediction
제목
비점오염물질측정망의 탁도 예측을 위한 수질 인자를 이용한 최적 머신러닝 알고리즘 선정
제목 (타언어)
Selection of the best fitted Machine Learning Algorithms for TurbidityPrediction using Water Quality Parameters in Non-point Source Pollution Monitoring Network
저자
김창균현제원
발행일
2024-12
유형
Y
저널명
Journal of Environmental Analysis, Health and Toxicology
27
4
페이지
249 ~ 256