Please wait a minute...
Frontiers of Environmental Science & Engineering

ISSN 2095-2201

ISSN 2095-221X(Online)

CN 10-1013/X

Postal Subscription Code 80-973

2018 Impact Factor: 3.883

Front. Environ. Sci. Eng.    2023, Vol. 17 Issue (7) : 88    https://doi.org/10.1007/s11783-023-1688-y
RESEARCH ARTICLE
A novel hybrid model for water quality prediction based on VMD and IGOA optimized for LSTM
Zhaocai Wang1, Qingyu Wang1, Tunhua Wu2()
1. College of Information, Shanghai Ocean University, Shanghai 201306, China
2. School of Information Engineering, Wenzhou Business College, Wenzhou 325035, China
 Download: PDF(4295 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

● A novel VMD-IGOA-LSTM model has proposed for the prediction of water quality.

● Improved model quickly converges to the global optimal fitness and remains stable.

● The prediction accuracy of water quality parameters is significantly improved.

Water quality prediction is vital for solving water pollution and protecting the water environment. In terms of the characteristics of nonlinearity, instability, and randomness of water quality parameters, a short-term water quality prediction model was proposed based on variational mode decomposition (VMD) and improved grasshopper optimization algorithm (IGOA), so as to optimize long short-term memory neural network (LSTM). First, VMD was adopted to decompose the water quality data into a series of relatively stable components, with the aim to reduce the instability of the original data and increase the predictability, then each component was input into the IGOA-LSTM model for prediction. Finally, each component was added to obtain the predicted values. In this study, the monitoring data from Dayangzhou Station and Shengmi Station of the Ganjiang River was used for training and prediction. The experimental results showed that the prediction accuracy of the VMD-IGOA-LSTM model proposed was higher than that of the integrated model of Ensemble Empirical Mode Decomposition (EEMD), the integrated model of Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), Nonlinear Autoregressive Network with Exogenous Inputs (NARX), Recurrent Neural Network (RNN), as well as other models, showing better performance in short-term prediction. The current study will provide a reliable solution for water quality prediction studies in other areas.

Keywords Water quality prediction      Grasshopper optimization algorithm      Variational mode decomposition      Long short-term memory neural network     
Corresponding Author(s): Tunhua Wu   
Issue Date: 13 February 2023
 Cite this article:   
Zhaocai Wang,Qingyu Wang,Tunhua Wu. A novel hybrid model for water quality prediction based on VMD and IGOA optimized for LSTM[J]. Front. Environ. Sci. Eng., 2023, 17(7): 88.
 URL:  
https://academic.hep.com.cn/fese/EN/10.1007/s11783-023-1688-y
https://academic.hep.com.cn/fese/EN/Y2023/V17/I7/88
Fig.1  LSTM network structure.
Fig.2  Reconstructed linear reduction factor c.
Fig.3  Convergence curves of two algorithms on different benchmark functions.
Fig.4  VMD-IGOA-LSTM model steps.
Fig.5  DO datasets of Dayangzhou Station.
Fig.6  Number of modes.
Fig.7  VMD decomposition results.
Fig.8  Observed and predicted values of each component.
Fig.9  Observed and predicted values of DO.
Fig.10  Predicted values of DO in each model for the Dayangzhou Station.
ModelMSERMSEMAEMAPEr2
VMD-IGOA-LSTM*0.00320.05650.04360.4%0.9877
EEMD-IGOA-LSTM0.02850.16910.10310.96%0.7780
CEEMDAN-IGOA-LSTM0.03540.18810.11991.12%0.8667
IGOA-LSTM0.08290.28780.15651.47%0.6450
LSTM0.08990.29990.17541.64%0.5955
NARX0.11470.33880.22452.06%0.5236
RNN0.12970.36010.25772.36%0.3361
BP0.11590.34040.23782.15%0.3898
Tab.1  Prediction error value of each model for the Dayangzhou Station
Fig.11  Error histogram of the prediction of each model for the Dayangzhou Station.
ModelMSERMSEMAEMAPEr2
VMD-IGOA-LSTM0.00440.06610.04950.38%0.9967
EEMD-IGOA-LSTM0.04330.20810.15671.19%0.9661
CEEMDAN-IGOA-LSTM0.04990.22350.17581.33%0.9608
IGOA-LSTM0.23770.48750.34832.58%0.8436
LSTM0.26310.51300.37152.75%0.7733
NARX0.34030.58330.45783.45%0.7134
RNN0.37900.61560.38642.88%0.6629
BP0.43560.65990.48040.03590.6433
Tab.2  Error values of each model for the Shengmi Station
Fig.12  Predicted values of DO in each model for the Shengmi Station.
Fig.13  Error histogram of the prediction of each model for the Shengmi Station.
Reference modelTested model
VMD-IGOA-LSTMEEMD-IGOA-LSTMCEEMDAN-IGOA-LSTM
EEMD-IGOA-LSTM?2.62(**)
CEEMDAN-IGOA-LSTM?2.31(**)?0.44
IGOA-LSTM?1.82(**)?1.24?1.44
LSTM?2.08(**)?1.50?1.75
NARX?4.54(**)?3.48(**)?4.37(**)
RNN?4.64(**)?4.00(**)?3.89(**)
BP?4.43(**)?3.17(**)?2.91(**)
Tab.3  DM test results of each model in Dayangzhou Station
Reference modelTested model
VMD-IGOA-LSTMEEMD-IGOA-LSTMCEEMDAN-IGOA-LSTM
EEMD-IGOA-LSTM?3.82(**)
CEEMDAN-IGOA-LSTM?4.30(**)?0.68
IGOA-LSTM?4.39(**)?3.63(**)?3.70(**)
LSTM?4.11(**)?3.49(**)?3.53(**)
NARX?4.86(**)?4.49 (**)?4.28(**)
RNN?3.50 (**)?3.26(**)?3.24(**)
BP?4.87(**)?4.44(**)?4.47 (**)
Tab.4  DM test results of each model in Shengmi Station
Fig.14  Correlation of various factors.
ModelMSERMSEMAEMAPEr2
VMD-IGOA-LSTM0.08670.29440.17541.59%0.5827
EEMD-IGOA-LSTM0.15000.38730.27052.44%0.2556
CEEMDAN-IGOA-LSTM0.13770.37110.25772.33%0.2544
LSTM0.11950.34580.22002.00%0.3684
NARX0.23820.48810.39113.54%0.1133
RNN0.13150.36270.23122.01%0.3318
Tab.5  Error values of each multi-factor model for the Dayangzhou Station
Fig.15  Predicted values of DO in multi-factor models in Dayangzhou Station.
Fig.16  Error histogram of the prediction in multi-factor models in the Dayangzhou Station.
Fig.17  Multi-step prediction results for each model.
1 A N Ahmed , F B Othman , H A Afan , R K Ibrahim , A Elshafie , M C Fai , M S Hossain , M Ehteram , A Elshafie . (2019). Machine learning methods for better water quality prediction. Journal of Hydrology (Amsterdam), 578: 124084
https://doi.org/10.1016/j.jhydrol.2019.124084
2 R Babbar , I Chaubey . (2021). Multiple regression analysis for predicting few water quality parameters at unmonitored sub-watershed outlets in the St. Joseph River basin, USA. Geocarto International, (11): 1–27
https://doi.org/10.1080/10106049.2021.2005156
3 J Bai , J Zhao , Z Y Zhang , Z Q Tian . (2022). Assessment and a review of research on surface water quality modeling. Ecological Modelling, 466: 109888
https://doi.org/10.1016/j.ecolmodel.2022.109888
4 J Bi , Y Z Lin , Q X Dong , H T Yuan , M C Zhou . (2021). Large-scale water quality prediction with integrated deep neural network. Information Sciences, 571: 191–205
https://doi.org/10.1016/j.ins.2021.04.057
5 L C BrownT O Barnwell (1987). The enhanced stream water quality models qual2e and qual2e-uncas: documentation and user manual. Washington DC: Environmental Research Laboratory Office of Research and Development U.S. Environment Protection Agency
6 C M S Burigato Costa , L da Silva Marques , A K Almeida , I R Leite , I K de Almeida . (2019). Applicability of water quality models around the world: a review. Environmental Science and Pollution Research, 26(36): 36141–36162
https://doi.org/10.1007/s11356-019-06637-2
7 Y Chen , R Zou , S Han , S Bai , M Faizullabhoy , Y Wu , H Guo . (2017). Development of an integrated water quality and macroalgae simulation model for Tidal Marsh eutrophication control decision support. Water (Basel), 9(4): 277
https://doi.org/10.3390/w9040277
8 Y Y ChuehC FanY Z Huang (2020). Copper concentration simulation in a river by swat-wasp integration and its application to assessing the impacts of climate change and various remediation strategies. Journal of Environmental Management, 279(2–4): 111613
9 W Deng , J Xu , X Z Gao , H Zhao . (2022a). An enhanced MSIQDE algorithm with novel multiple strategies for global optimization problems. IEEE Transactions on Systems, Man, and Cybernetics. Systems, 52(3): 1578–1587
https://doi.org/10.1109/TSMC.2020.3030792
10 W Deng , J Xu , H Zhao , Y Song . (2022b). A novel gate resource allocation method using improved PSO-based QEA. IEEE Transactions on Intelligent Transportation Systems, 23(3): 1737–1745
https://doi.org/10.1109/TITS.2020.3025796
11 Y Deng , X Zhou , J Shen , G Xiao , H Hong , H Lin , F Wu , B Q Liao . (2021). New methods based on backpropagation (BP) and radial basis function (RBF) artificial neural networks (ANNS) for predicting the occurrence of haloketones in tap water. Science of the Total Environment, 772(6): 145534
https://doi.org/10.1016/j.scitotenv.2021.145534
12 F Diebold , R Mariano . (1995). Comparing predictive accuracy. Journal of Business & Economic Statistics, 13(3): 253–263
13 Y R Ding , Y J Cai , P D Sun , B Chen . (2014). The use of combined neural networks and genetic algorithms for prediction of river water quality. Journal of Applied Research and Technology, 12(3): 493–499
https://doi.org/10.1016/S1665-6423(14)71629-3
14 K Dragomiretskiy , D Zosso . (2014). Variational mode decomposition. IEEE Transactions on Signal Processing, 62(3): 531–544
https://doi.org/10.1109/TSP.2013.2288675
15 S H Ewaid , S A Abed , S A Kadhum . (2018). Predicting the Tigris River water quality within Baghdad, Iraq by using water quality index and regression analysis. Environmental Technology & Innovation, 11: 390–398
https://doi.org/10.1016/j.eti.2018.06.013
16 X B Feng , J Zhong , R Yan , Z H Zhou , L Tian , J Zhao , Z Y Yuan . (2022). Groundwater radon precursor anomalies identification by EMD-LSTM model. Water (Basel), 14(1): 69
https://doi.org/10.3390/w14010069
17 D P HamiltonS G Schladow (1997). Prediction of water quality in lakes and reservoirs. Part I, Model description. Ecological Modelling, 96(1–3): 1–3
18 K Y Han , S H Kim , D H Bae . (2001). Stochastic water quality analysis using reliability method. Journal of the American Water Resources Association, 37(3): 695–708
https://doi.org/10.1111/j.1752-1688.2001.tb05504.x
19 M He , S F Wu , B B Huang , C X Kang , F L Gui . (2022). Prediction of total nitrogen and phosphorus in surface water by deep learning method based on multi-scale feature extraction. Water (Basel), 14(10): 1643
https://doi.org/10.3390/w14101643
20 S Hochreiter , J Schmidhuber . (1997). Long short-term memory. Neural Computation, 9(8): 1735–1780
https://doi.org/10.1162/neco.1997.9.8.1735
21 N E Huang , Z Shen , S R Long , M C Wu , H H Shih , Q Zheng , N C Yen , C C Tung , H H Liu . (1998). The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series data analysis. Proceedings Mathematical Physical & Engineering Sciences, 454(1971): 903–995
22 Y Huang , J Chen , Q Duan , Y Feng , R Luo , W Wang , F Liu , S Bi , J Lee . (2022). A fast antibiotic detection method for simplified pretreatment through spectra-based machine learning. Frontiers of Environmental Science & Engineering, 16(3): 38
https://doi.org/10.1007/s11783-021-1472-9
23 Z Ji , Z Wang , X Deng , W Huang , T Wu . (2019). A new parallel algorithm to solve one classic water resources optimal allocation problem based on inspired computational model. Desalination and Water Treatment, 160: 214–218
https://doi.org/10.5004/dwt.2019.24386
24 Y Jiang . (2015). China’s water security: current status, emerging challenges and future prospect. Environmental Science and Pollution Research International, 54: 106–125
25 T Jin , S Cai , D Jiang , J Liu . (2019). A data-driven model for real-time water quality prediction and early warning by an intergration method. Environmental Science and Pollution Research International, 26(29): 30374–30385
https://doi.org/10.1007/s11356-019-06049-2
26 J Kim , T Lee , D Seo . (2017). Algal bloom prediction of the lower Han River, Korea using the EFDC hydrodynamic and water quality model. Ecological Modelling, 366: 27–36
https://doi.org/10.1016/j.ecolmodel.2017.10.015
27 Y X Lan . (2021). Grasshopper optimization algorithm based on chaos and cauchy mutation and feature selection. Microelectronics & Computer, 38(11): 21–30
28 W C Leong , A Bahadori , J Zhang , Z Ahmad . (2021). Prediction of water quality index (WQI) using support vector machine (SVM) and least square-support vector machine (LS-SVM). International Journal of River Basin Management, 19(2): 149–156
https://doi.org/10.1080/15715124.2019.1628030
29 X Li , J Sha , Z L Wang . (2017). A comparative study of multiple linear regression, artificial neural network and support vector machine for the prediction of dissolved oxygen. Nordic Hydrology, 48(5): 1214–1225
https://doi.org/10.2166/nh.2016.149
30 Z Li , F Peng , B Niu , G Li , J Wu , Z Miao . (2018). Water quality prediction model combining sparse auto-encoder and LSTM network. IFAC-PapersOnLine, 51(17): 831–836
https://doi.org/10.1016/j.ifacol.2018.08.091
31 Y Liu , Q Zhang , L Song , Y Chen . (2019). Attention-based recurrent neural networks for accurate short-term and long-term dissolved oxygen prediction. Computers and Electronics in Agriculture, 165: 104964
https://doi.org/10.1016/j.compag.2019.104964
32 H Mohammed , H M Tornyeviadzi , R Seidu . (2021). Modelling the impact of weather parameters on the microbial quality of water in distribution systems. Journal of Environmental Management, 284(1): 111997
https://doi.org/10.1016/j.jenvman.2021.111997
33 M Najafzadeh , S Niazmardi . (2021). A novel multiple-kernel support vector regression algorithm for estimation of water quality parameters. Natural Resources Research, 30(5): 3761–3775
https://doi.org/10.1007/s11053-021-09895-5
34 T Oki , S Kanae . (2006). Global hydrological cycles and world water resources. Science, 313(5790): 1068–1072
https://doi.org/10.1126/science.1128845
35 T Rajaee , S Khani , M Ravansalar . (2020). Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: a review. Chemometrics and Intelligent Laboratory Systems, 200: 103978
https://doi.org/10.1016/j.chemolab.2020.103978
36 S Saremi , S Mirjalili , A Lewis . (2017). Grasshopper optimization algorithm: theory and application. Advances in Engineering Software, 105: 30–47
https://doi.org/10.1016/j.advengsoft.2017.01.004
37 D Seo , M Kim , J H Ahn . (2012). Prediction of chlorophyll-a changes due to weir constructions in the Nakdong River using EFDC-WASP modelling. Environmental Engineering Research, 17(2): 95–102
https://doi.org/10.4491/eer.2012.17.2.095
38 D M D Toro , J J Fitzpatrick , R V Thomann . (1983). Documentation for water quality analysis simulation program (WASP) and model verification program (MVP). Proceedings of the Society for Photo-Instrumentation Engineers, 34(5): 4–10
39 B Vaheddoost , H Aksoy . (2021). Regressive-stochastic models for predicting water level in Lake Urmia. Hydrological Sciences Journal, 66(13): 1892–1906
https://doi.org/10.1080/02626667.2021.1974447
40 C J Vörösmarty , P B McIntyre , M O Gessner , D Dudgeon , A Prusevich , P Green , S Glidden , S E Bunn , C A Sullivan , C R Liermann , P M Davies . (2010). Global threats to human water security and river biodiversity. Nature, 467(7315): 555–561
https://doi.org/10.1038/nature09440
41 Z Wang , A Deng , D Wang , T Wu . (2022). A parallel algorithm to solve the multiple travelling salesmen problem based on molecular computing model. International Journal of Bio-Inspired Computation, 20(3): 160–171
https://doi.org/10.1504/ijbic.2022.127504
42 Z Wang , X Wu , H Wang , T Wu . (2021). Prediction and analysis of domestic water consumption based on optimized grey and Markov model. Water Science and Technology: Water Supply, 21(7): 3887–3899
https://doi.org/10.2166/ws.2021.146
43 J Wu , Z Li , L Zhu , G Li , B Niu , F Peng . (2018). Optimized bp neural network for dissolved oxygen prediction. IFAC-PapersOnLine, 51(17): 596–601
https://doi.org/10.1016/j.ifacol.2018.08.132
44 J Wu , Z Wang . (2022). A hybrid model for water quality prediction based on an artificial neural network, wavelet transform, and long short-term memory. Water (Basel), 14(4): 610
https://doi.org/10.3390/w14040610
45 J Xu , M Xu , Y X Zhao , S F Wang , M H Tao , Y G Wang . (2021). Spatial-temporal distribution and evolutionary characteristics of water environment sudden pollution incidents in China from 2006 to 2018. Science of the Total Environment, 801: 149677
https://doi.org/10.1016/j.scitotenv.2021.149677
46 L Xu , J Shen , D Marinova , X Guo , F Sun , F Zhu . (2013). Changes of public environmental awareness in response to the Taihu blue-green algae bloom incident in China. Environment, Development and Sustainability, 15(5): 1281–1302
https://doi.org/10.1007/s10668-013-9440-6
47 R Yao , C Guo , W Deng , H Zhao . (2022). A novel mathematical morphology spectrum entropy based on scale-adaptive techniques. ISA Transactions, 126: 691–702
https://doi.org/10.1016/j.isatra.2021.07.017
48 R L Yu , C Zhang . (2021). Early warning of water quality degradation: a copula-based Bayesian network model for highly efficient water quality risk assessment. Journal of Environmental Management, 292: 112749
https://doi.org/10.1016/j.jenvman.2021.112749
49 J Zhang , Y Zhu , X Zhang , M Ye , J Yang . (2018). Developing a long short-term memory (LSTM) based model for predicting water table depth in agricultural areas. Journal of Hydrology (Amsterdam), 561: 918–929
https://doi.org/10.1016/j.jhydrol.2018.04.065
50 Z Zhou , C Lin , S Li , S Liu , F Li , B Yuan . (2022). Four kinds of capping materials for controlling phosphorus and nitrogen release from contaminated sediment using a static simulation experiment. Frontiers of Environmental Science & Engineering, 16(3): 29
https://doi.org/10.1007/s11783-021-1463-x
51 Z Zhu , N Oberg , V M Morales , J C Quijano , B J Landry , M H Garcia . (2016). Integrated urban hydrologic and hydraulic modelling in Chicago, Illinois. Environmental Modelling & Software, 77: 63–70
https://doi.org/10.1016/j.envsoft.2015.11.014
[1] Junlang Li, Zhenguo Chen, Xiaoyong Li, Xiaohui Yi, Yingzhong Zhao, Xinzhong He, Zehua Huang, Mohamed A. Hassaan, Ahmed El Nemr, Mingzhi Huang. Water quality soft-sensor prediction in anaerobic process using deep neural network optimized by Tree-structured Parzen Estimator[J]. Front. Environ. Sci. Eng., 2023, 17(6): 67-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed