Considering wastewater properties from the rapid development of industrial growth, it is necessary to have an optimal treatment system for this wastewater to mitigate the environmental impact of discharges and comply with environmental regulatory stan...
Considering wastewater properties from the rapid development of industrial growth, it is necessary to have an optimal treatment system for this wastewater to mitigate the environmental impact of discharges and comply with environmental regulatory standards. The quality of industrial wastewater effluent varies depending on factors such as wastewater source and treatment process (physical, chemical, biological). Governing equations and data-based prediction models are used to determine treatment facilities' process characteristics and behavior. However, a lot of analysis is required to comply with environmental regulations. Therefore, a typical wastewater treatment facility's operation relies on experienced people and their knowledge. The 4th Industrial Revolution, the next generation industrial revolution, consisting of the convergence of information and communication technology (ICT), has already been introduced to Korea and worldwide in various fields such as artificial intelligence (AI), the Internet of Things (IoT), and big data. However, in the domestic environmental technology field, the research and development of artificial intelligence technology is insufficient compared to other fields. Although various methods of applying artificial intelligence technology to wastewater treatment facilities are being proposed abroad, research and development in Korea have not yet been conducted on various water treatment systems. Also, there is a lack of standardized data-based model development cases and guidelines that can be optimally applied. Therefore, to apply artificial intelligence technology to wastewater treatment facilities, it is necessary to utilize the results derived from various studies and their comparison and analysis. Recently, the prediction models that can forecast the effluent water quality from wastewater treatment have been actively developed from AI-based machine learning and deep learning models and can be widely applied to wastewater treatment processes.
In this study, we compared and analyzed the performance of machine learning algorithms (SVM, RF, ANN) and deep learning algorithms (LSTM) using the daily flow rate and BOD concentration of effluent from the wastewater treatment process. In addition, we compared and analyzed the performance of a similar wastewater treatment process. The applicability of transfer learning was analyzed in two wastewater treatment processes.
This study assessed the performance of single and modified algorithms based on machine learning and deep learning for wastewater treatment process. More specifically, this study adopted support vector machine (SVM), random forest (RF), and artificial neural network (ANN) for machine learning as well as long short-term memory (LSTM) for deep learning. The performance of these (single) algorithms were compared with that of modified ones processed through hyperparameter tuning, ensemble learning (only for machine learning), and multi-layer stacking (i.e., two layers of LSTM units). The daily effluent of wastewater treatment process observed between 2017 and 2022 in the Cheong-Ju National Industrial Complex was used as input to all tested algorithms, which was evaluated with respect to mean squared error. For the model performance evaluation, discharge and biochemical oxygen demand are selected as dependent variables out of nine measured parameters. Results showed that the performance of any machine learning algorithms was superior to their competitor LSTM. This is mainly attributed to a small amount of input data provided to the LSTM algorithm and unstable effluent wastewater characteristics. Meanwhile, hyperparameter tuning improved the performance of all tested algorithms. However, ensemble learning for machine learning and two-layer stacking for LSTM generally resulted in performance degradation as compared to that of single algorithms, regardless of dependent variables. Therefore, this calls for a careful design and evaluation of modified algorithms, specifically for model architecture and performance improvement processes.
This study assessed the feasibility of transfer learning from one wastewater treatment process to another using two popular deep learning algorithms. Specifically, convolutional neural network (CNN) and long short-term memory (LSTM), which consisted of four and three hidden layers, respectively, were used as benchmark algorithms for transfer learning. Input data for both deep learning and transfer learning were provided from two wastewater treatment plants with identical treatment trains in series (located in Jinju and Cheongju City) over the five-year period from 2018 to 2022. Performance evaluation was also done not only against two deep learning algorithms but also against those adopting two transfer learning strategies, one for freezing all hidden layers developed from the pre-trained model and the other for training the last hidden layer only among multiple ones, with respect to Mean Squared Error (MSE). We found that the performance of both CNN and LSTM was relatively comparative regardless of dependent variables, discharge and biochemical oxygen demand (BOD), whereas the prediction accuracy of both algorithms was slightly higher for discharge than for BOD due to its low variability. When transfer learning which froze all hidden layers of the existing model was applied to two benchmark algorithms, the predictive performance of both algorithms was found to slightly improved only for discharge. Also, there was no measurable variation in the prediction accuracy of benchmark algorithms using the other transfer learning approach. Potential applications of transfer learning include the rapid reuse of the existing models (developed from source domains) for target domains which are hard to develop new prediction models due to the lack of data in deep learning.
The results derived from machine learning, deep learning, and transfer learning in wastewater treatment can be used to develop data-based model development standardization guidelines. This might be due to that the primary data could be used in future intelligent wastewater treatment algorithms and scenarios.