随机森林
富营养化
支持向量机
多层感知器
感知器
决策树
主成分分析
水华
环境科学
机器学习
线性回归
统计
计算机科学
人工智能
人工神经网络
浮游植物
数学
生态学
营养物
生物
作者
Hua‐Dong Huang,Jing Zhang
标识
DOI:10.1016/j.envpol.2024.123501
摘要
Four different methods were used to identify the important factors influencing chlorophyll-a (Chl-a) content: correlation analysis (CC-NMI), principal component analysis (PCA), decision tree (DT), and random forest recursive feature elimination (RF-RFE). Considering the relationship between Chl-a and its active and passive factors, we established machine learning combination models based on multiple linear regression (MLR), multi-layer perceptron (MLP), and support vector regression (SVR) to predict Chl-a content for Poyang Lake, China. Then, the predictive effects of different combination models were compared and evaluated from multiple perspectives. Considering the actual needs for eutrophication prevention and control, the concept of risk probability was then introduced to assess the risk degree of risk associated with water blooms in Poyang Lake. The results indicated that the mean R2 for the Chl-a predictions using the MLR, MLP, and SVR models was 0.21, 0.61, and 0.75, respectively. Consequently, the SVR model demonstrated higher precision and more accurate predictions. Compared to other methods, integrating the SVR model with the RF-RFE method significantly improved the prediction accuracy, with the R2 increasing to 0.94. For Poyang Lake, 8.8% of random samples indicated a low risk level with a water bloom probability of 21.1%-36.5%; one sample indicated a medium risk level with a risk probability of 45.5%. The research results offer valuable insights for predicting eutrophication and conducting risk assessments for Poyang Lake. They also provide reliable scientific support for making decisions about eutrophication in lakes and reservoirs. Therefore, the results hold significant theoretical importance, practical value, and potential for widespread application.
科研通智能强力驱动
Strongly Powered by AbleSci AI