|
|
Utilizing machine learning models to grasp water quality dynamic changes in lake eutrophication through phytoplankton parameters |
Yong Fang, Ruting Huang( ), Yeyin Zhang, Jun Zhang, Wenni Xi, Xianyang Shi( ) |
Anhui Province Key Laboratory of Wetland Ecosystem Protection and Restoration, School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China |
|
|
Abstract Phytoplankton serve as vital indicators of eutrophication levels. However, relying solely on phytoplankton parameters, such as chlorophyll-a, limits our comprehensive understanding of the intricate eutrophication conditions in natural lakes, particularly in terms of timely analysis of changes in limiting nutrients and their concentrations. This study presents machine learning (ML) models for predicting and identifying lake eutrophication. Five tree-based ML models were developed using the latest data on hydrological, water quality, and meteorological parameters obtained from 34 sites in the Huating Lake basin over 5 months. The extreme gradient boosting model exhibited high accuracy in predicting the total nitrogen/total phosphorus ratio (TN/TP) (R2 = 0.88; RMSE = 24.60; MAPE = 26.14%). Analysis of the TN/TP ratio and output eigenvalue weight revealed that phosphorus plays a crucial role in eutrophication, probably because of the low-flow and deep-water characteristics of the basin. Furthermore, the light gradient boosting machine model exhibited outstanding performance and high accuracy in predicting phytoplankton parameters, especially the Shannon index (H′) (R2 = 0.92; RMSE = 0.11; MAPE = 4.95%). The mesotrophic classification of the Huating Lake determined using the H′ threshold, coincided with the findings from the H′ analysis. Future research should cover a wider range of pollution sources and spatiotemporal dimensions to further validate our findings. Overall, this study highlights the potential of incorporating the TN/TP ratio and phytoplankton parameters into ML techniques for effective monitoring and management of environmental conditions.
|
Keywords
Machine learning
Lake
Phytoplankton
Water quality
|
Corresponding Author(s):
Ruting Huang,Xianyang Shi
|
Issue Date: 21 November 2024
|
|
1 |
M J Behrenfeld, E S Boss, K H Halsey. (2021). Phytoplankton community structuring and succession in a competition-neutral resource landscape. ISME Communications, 1(1): 12
https://doi.org/10.1038/s43705-021-00011-5
|
2 |
K P Brown, A Gerber, D Bedulina, M A Timofeyev. (2021). Human impact and ecosystemic health at Lake Baikal. WIREs. Water, 8(4): e1528
https://doi.org/10.1002/wat2.1528
|
3 |
S M Burdick, D A Hewitt, B A Martin, L Schenk, S A Rounds. (2020). Effects of harmful algal blooms and associated water-quality on endangered Lost River and shortnose suckers. Harmful Algae, 97: 101847
https://doi.org/10.1016/j.hal.2020.101847
|
4 |
J C Carrasco Navas-Parejo, A Corzo, S Papaspyrou. (2020). Seasonal cycles of phytoplankton biomass and primary production in a tropical temporarily open-closed estuarine lagoon: the effect of an extreme climatic event. Science of the Total Environment, 723: 138014
https://doi.org/10.1016/j.scitotenv.2020.138014
|
5 |
Y Chi, D Liu, W Xing, J Wang. (2021). Island ecosystem health in the context of human activities with different types and intensities. Journal of Cleaner Production, 281: 125334
https://doi.org/10.1016/j.jclepro.2020.125334
|
6 |
D J Conley, H W Paerl, R W Howarth, D F Boesch, S P Seitzinger, K E Havens, C Lancelot, G E Likens. (2009). Controlling eutrophication: nitrogen and phosphorus. Science, 323(5917): 1014–1015
https://doi.org/10.1126/science.1167755
|
7 |
J Derot, A Jamoneau, N Teichert, J Rosebery, S Morin, C Laplace-Treyture. (2020). Response of phytoplankton traits to environmental variables in French lakes: new perspectives for bioindication. Ecological Indicators, 108: 105659
https://doi.org/10.1016/j.ecolind.2019.105659
|
8 |
S Dhaliwal, A Nahid, R Abbas. (2018). Effective intrusion detection system using XGBoost. Information, 9(7): 149
https://doi.org/10.3390/info9070149
|
9 |
F Ding, W Zhang, S Cao, S Hao, L Chen, X Xie, W Li, M Jiang. (2023). Optimization of water quality index models using machine learning approaches. Water Research, 243: 120337
https://doi.org/10.1016/j.watres.2023.120337
|
10 |
X Dong, S Zeng, F Bai, D Li, M He. (2016). Extracellular microcystin prediction based on toxigenic Microcystis detection in a eutrophic lake. Scientific Reports, 6(1): 20886
https://doi.org/10.1038/srep20886
|
11 |
C Feng (2007). Studies on the agricultural ecological tour development in Huating Lake scenic spot. Anhui Nongye Kexue, 35(7): 2035–2037 (in Chinese)
|
12 |
L Feng, Y Dai, X Hou, Y Xu, J Liu, C Zheng. (2021). Concerns about phytoplankton bloom trends in global lakes. Nature, 590(7846): E35–E47
https://doi.org/10.1038/s41586-021-03254-3
|
13 |
A C C Fortes, P R G Barrocas, D C Kligerman. (2023). Water quality indices: construction, potential, and limitations. Ecological Indicators, 157: 111187
https://doi.org/10.1016/j.ecolind.2023.111187
|
14 |
A D L Fuente, A M Muro-Pastor, F Merchán, F Madrid, J I Pérez-Martínez, T Undabeytia. (2019). Electrocoagulation/flocculation of cyanobacteria from surface waters. Journal of Cleaner Production, 238: 117964
https://doi.org/10.1016/j.jclepro.2019.117964
|
15 |
F Ge, Z Ma, B Chen, Y Wang, X Lu, S An, D Zhang, W Zhang, W Yu, W Han. et al.. (2022). Phytoplankton species diversity patterns and associated driving factors in China’s Jiulong River estuary: roles that nutrients and nutrient ratios play. Frontiers in Marine Science, 9: 829285
https://doi.org/10.3389/fmars.2022.829285
|
16 |
P L Georgescu, S Moldovanu, C Iticescu, M Calmuc, V Calmuc, C Topa, L Moraru. (2023). Assessing and forecasting water quality in the Danube River by using neural network approaches. Science of the Total Environment, 879: 162998
https://doi.org/10.1016/j.scitotenv.2023.162998
|
17 |
J Horppila. (2019). Sediment nutrients, ecological status and restoration of lakes. Water Research, 160: 206–208
https://doi.org/10.1016/j.watres.2019.05.074
|
18 |
R W Howarth, R Marino (2006). Nitrogen as the limiting nutrient for eutrophication in coastal marine ecosystems: Evolving views over three decades. Limnology and Oceanography, 51(1111): 364–376
|
19 |
L Hu, K Shan, L Huang, Y Li, L Zhao, Q Zhou, L Song. (2021). Environmental factors associated with cyanobacterial assemblages in a mesotrophic subtropical plateau lake: a focus on bloom toxicity. Science of the Total Environment, 777: 146052
https://doi.org/10.1016/j.scitotenv.2021.146052
|
20 |
Y Hu, W Du, C Yang, Y Wang, T Huang, X Xu, W Li. (2023). Source identification and prediction of nitrogen and phosphorus pollution of Lake Taihu by an ensemble machine learning technique. Frontiers of Environmental Science & Engineering, 17(5): 55
https://doi.org/10.1007/s11783-023-1655-7
|
21 |
L Hua, W Li, L Zhai, H Yen, Q Lei, H Liu, T Ren, Y Xia, F Zhang, X Fan. (2019). An innovative approach to identifying agricultural pollution sources and loads by using nutrient export coefficients in watershed modeling. Journal of Hydrology, 571: 322–331
https://doi.org/10.1016/j.jhydrol.2019.01.043
|
22 |
S H Jenkins. (1982). Standard methods for the examination of water and wastewater. Water Research, 16(10): 1495–1496
https://doi.org/10.1016/0043-1354(82)90249-4
|
23 |
J Jia, Y Gao, X Song, S Chen. (2019). Characteristics of phytoplankton community and water net primary productivity response to the nutrient status of the Poyang Lake and Gan River, China. Ecohydrology, 12(7): e2136
https://doi.org/10.1002/eco.2136
|
24 |
M Jiang, S I Nakano. (2022). The crucial influence of trophic status on the relative requirement of nitrogen to phosphorus for phytoplankton growth. Water Research, 222: 118868
https://doi.org/10.1016/j.watres.2022.118868
|
25 |
M Jin, Z Ren, J P Shi, X Z Huang, J R Chen. (2010). Impact of agricultural non-point source pollution in eutrophic water body of Taihu Lake. Environmental Science & Technology, 33(10): 106–111
|
26 |
K Kim. (2016). A hybrid classification algorithm by subspace partitioning through semi-supervised decision tree. Pattern Recognition, 60: 157–163
https://doi.org/10.1016/j.patcog.2016.04.016
|
27 |
N Li, J Wang, W Yin, H Jia, J Xu, R Hao, Z Zhong, Z Shi. (2021). Linking water environmental factors and the local watershed landscape to the chlorophyll a concentration in reservoir bays. Science of the Total Environment, 758: 143617
https://doi.org/10.1016/j.scitotenv.2020.143617
|
28 |
S Li, C Liu, P Sun, T Ni. (2022). Response of cyanobacterial bloom risk to nitrogen and phosphorus concentrations in large shallow lakes determined through geographical detector: a case study of Taihu Lake, China. Science of the Total Environment, 816: 151617
https://doi.org/10.1016/j.scitotenv.2021.151617
|
29 |
X Li, W Xu, S Song, J Sun. (2023). Sources and spatiotemporal distribution characteristics of nitrogen and phosphorus loads in the Haihe River Basin, China. Marine Pollution Bulletin, 189: 114756
https://doi.org/10.1016/j.marpolbul.2023.114756
|
30 |
E Litchman, C A Klausmeier. (2008). Trait-based community ecology of phytoplankton. Annual Review of Ecology, Evolution, and Systematics, 39(1): 615–639
https://doi.org/10.1146/annurev.ecolsys.39.110707.173549
|
31 |
Y Liu, H Luo, B Zhao, X Zhao, Z Han (2018). Short-Term Power Load Forecasting Based on Clustering and XGBoost Method. New York: Institute of Electrical and Electronics Engineers
|
32 |
Y Liu, Y Zhuang, B Ji, G Zhang, L Rong, G Teng, C Wang. (2022). Prediction of laying hen house odor concentrations using machine learning models based on small sample data. Computers and Electronics in Agriculture, 195: 106849
https://doi.org/10.1016/j.compag.2022.106849
|
33 |
F Meng, Z Li, L Li, F Lu, Y Liu, X Lu, Y Fan. (2020). Phytoplankton alpha diversity indices response the trophic state variation in hydrologically connected aquatic habitats in the Harbin Section of the Songhua River. Scientific Reports, 10(1): 21337
https://doi.org/10.1038/s41598-020-78300-7
|
34 |
P Muhid, T W Davis, S E Bunn, M A Burford. (2013). Effects of inorganic nutrients in recycled water on freshwater phytoplankton biomass and composition. Water Research, 47(1): 384–394
https://doi.org/10.1016/j.watres.2012.10.015
|
35 |
B Qin, J Zhou, J J Elser, W S Gardner, J Deng, J D Brookes. (2020). Water depth underpins the relative roles and fates of nitrogen and phosphorus in lakes. Environmental Science & Technology, 54(6): 3191–3198
https://doi.org/10.1021/acs.est.9b05858
|
36 |
K Rao, X Zhang, X Yi, Z Li, P Wang, G Huang, X Guo (2018). Interactive effects of environmental factors on phytoplankton communities and benthic nutrient interactions in a shallow lake and adjoining rivers in China. Science of the Total Environment, 619–620: 1661–1672
|
37 |
G T Reddy, M P K Reddy, K Lakshmanna, R Kaluri, D S Rajput, G Srivastava, T Baker. (2020). Analysis of dimensionality reduction techniques on big data. IEEE Access: Practical Innovations, Open Solutions, 8: 54776–54788
https://doi.org/10.1109/ACCESS.2020.2980942
|
38 |
M Rezaie-Balf, N F Attar, A Mohammadzadeh, M A Murti, A N Ahmed, C M Fai, N Nabipour, S Alaghmand, A El-Shafie. (2020). Physicochemical parameters data assimilation for efficient improvement of water quality index prediction: comparative assessment of a noise suppression hybridization approach. Journal of Cleaner Production, 271: 122576
https://doi.org/10.1016/j.jclepro.2020.122576
|
39 |
K Shan, L Song, W Chen, L Li, L Liu, Y Wu, Y Jia, Q Zhou, L Peng. (2019). Analysis of environmental drivers influencing interspecific variations and associations among bloom-forming cyanobacteria in large, shallow eutrophic lakes. Harmful Algae, 84: 84–94
https://doi.org/10.1016/j.hal.2019.02.002
|
40 |
K P Singh, A Malik, S Sinha. (2005). Water quality assessment and apportionment of pollution sources of Gomti River (India) using multivariate statistical techniques: a case study. Analytica Chimica Acta, 538(1−2): 355–374
https://doi.org/10.1016/j.aca.2005.02.006
|
41 |
Y Tian, Y Jiang, Q Liu, D Xu, Y Liu, J Song. (2021). The impacts of local and regional factors on the phytoplankton community dynamics in a temperate river, northern China. Ecological Indicators, 123: 107352
https://doi.org/10.1016/j.ecolind.2021.107352
|
42 |
M G Uddin, S Nash, M T Mahammad Diganta, A Rahman, A I Olbert. (2022a). Robust machine learning algorithms for predicting coastal water quality index. Journal of Environmental Management, 321(8): 115923
https://doi.org/10.1016/j.jenvman.2022.115923
|
43 |
M G Uddin, S Nash, A Rahman, T Dabrowski, A I Olbert. (2024a). Data-driven modelling for assessing trophic status in marine ecosystems using machine learning approaches. Environmental Research, 242: 117755
https://doi.org/10.1016/j.envres.2023.117755
|
44 |
M G Uddin, S Nash, A Rahman, A I Olbert. (2022b). A comprehensive method for improvement of water quality index (WQI) models for coastal water quality assessment. Water Research, 219: 118532
https://doi.org/10.1016/j.watres.2022.118532
|
45 |
M G Uddin, S Nash, A Rahman, A I Olbert. (2023a). A novel approach for estimating and predicting uncertainty in water quality index model using machine learning approaches. Water Research, 229: 119422
https://doi.org/10.1016/j.watres.2022.119422
|
46 |
M G Uddin, S Nash, A Rahman, A I Olbert. (2023b). A sophisticated model for rating water quality. Science of the Total Environment, 868: 161614
https://doi.org/10.1016/j.scitotenv.2023.161614
|
47 |
M G Uddin, A Rahman, F Rosa Taghikhah, A I Olbert. (2024b). Data-driven evolution of water quality models: an in-depth investigation of innovative outlier detection approaches-A case study of Irish Water Quality Index (IEWQI) model. Water Research, 255: 121499
https://doi.org/10.1016/j.watres.2024.121499
|
48 |
X Wang, D Fu, Y Wang, Y Guo, Y Ding. (2021). The XGBoost and the SVM-based prediction models for bioretention cell decontamination effect. Arabian Journal of Geosciences, 14(8): 669
https://doi.org/10.1007/s12517-021-07013-6
|
49 |
Z Wu, Y Liu, Z Liang, S Wu, H Guo. (2017). Internal cycling, not external loading, decides the nutrient limitation in eutrophic lake: a dynamic model with temporal Bayesian hierarchical inference. Water Research, 116: 231–240
https://doi.org/10.1016/j.watres.2017.03.039
|
50 |
J Xiong, C Lin, Z Cao, M Hu, K Xue, X Chen, R Ma (2022). Development of remote sensing algorithm for total phosphorus concentration in eutrophic lakes: conventional or machine learning? Water Research, 215(1): 118213
|
51 |
J Xiong, C Lin, R Ma, Z Cao. (2019). Remote sensing estimation of lake total phosphorus concentration based on MODIS: a case study of Lake Hongze. Remote Sensing, 11(17): 2068
https://doi.org/10.3390/rs11172068
|
52 |
W Xu, X Li, Y Li, Y Sun, L Zhang, Y Huang, Z Yang. (2021). Rising temperature more strongly promotes low-abundance Paramecium to remove Microcystis and degrade Microcystins. Environmental Pollution, 291: 118143
https://doi.org/10.1016/j.envpol.2021.118143
|
53 |
W Xu, X Su. (2019). Challenges and impacts of climate change and human activities on groundwater-dependent ecosystems in arid areas: a case study of the Nalenggele alluvial fan in NW China. Journal of Hydrology, 573: 376–385
https://doi.org/10.1016/j.jhydrol.2019.03.082
|
54 |
Y Yang, B Gao, H Hao, H Zhou, J Lu. (2017). Nitrogen and phosphorus in sediments in China: a national-scale assessment and review. Science of the Total Environment, 576: 840–849
https://doi.org/10.1016/j.scitotenv.2016.10.136
|
55 |
R Ye, K Shan, H Gao, R Zhang, W Xiong, Y Wang, X Qian. (2014). Spatio-temporal distribution patterns in environmental factors, chlorophyll-a and microcystins in a large shallow lake, Lake Taihu, China. International Journal of Environmental Research and Public Health, 11(5): 5155–5169
https://doi.org/10.3390/ijerph110505155
|
56 |
H Yu, S Jiang, K C Land. (2015). Multicollinearity in hierarchical linear models. Social Science Research, 53: 118–136
https://doi.org/10.1016/j.ssresearch.2015.04.008
|
57 |
Q Yu, F Wang, W Yan, F Zhang, S Lv, Y Li. (2018). Carbon and nitrogen burial and response to climate change and anthropogenic disturbance in Chaohu Lake, China. International Journal of Environmental Research and Public Health, 15(12): 2734
https://doi.org/10.3390/ijerph15122734
|
58 |
L L Yuan, A I Pollard. (2017). Using national-scale data to develop nutrient–microcystin relationships that guide management decisions. Environmental Science & Technology, 51(12): 6972–6980
https://doi.org/10.1021/acs.est.7b01410
|
59 |
F Zhang, B Xue, Y Cai, H Xu, W Zou. (2023). Utility of trophic state index in lakes and reservoirs in the Chinese eastern plains ecoregion: the key role of water depth. Ecological Indicators, 148: 110029
https://doi.org/10.1016/j.ecolind.2023.110029
|
60 |
J Zhang, P Fu, F Meng, X Yang, J Xu, Y Cui. (2022). Estimation algorithm for chlorophyll-a concentrations in water from hyperspectral images based on feature derivation and ensemble learning. Ecological Informatics, 71: 101783
https://doi.org/10.1016/j.ecoinf.2022.101783
|
61 |
M Zhang, N Leyi, T Cao, T Fang, D W Xiong, G J Zhou, G R Zhu, X U Jun, L G Guo. (2010). Impact of aquatic environmental factors on distribution pattern of aquatic macrophytes in upper reaches of Taihu Lake watershed. Environmental Science & Technology, 33(3): 171–174
|
62 |
N Zhang, S Zang. (2015). Characteristics of phytoplankton distribution for assessment of water quality in the Zhalong Wetland, China. International Journal of Environmental Science and Technology, 12(11): 3657–3664
https://doi.org/10.1007/s13762-015-0795-0
|
63 |
P Znachor, J Nedoma, J Hejzlar, J Seďa, J Komárková, V Kolář, T Mrkvička, D S Boukal. (2020). Changing environmental conditions underpin long-term patterns of phytoplankton in a freshwater reservoir. Science of the Total Environment, 710: 135626
https://doi.org/10.1016/j.scitotenv.2019.135626
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|