Validation of wastewater data using artificial intelligence tools and the evaluation of their performance regarding annotator agreement.

Fiche publication


Date publication

juin 2023

Journal

Water science and technology : a journal of the International Association on Water Pollution Research

Auteurs

Membres identifiés du Cancéropôle Est :
Pr WEMMERT Cédric


Tous les auteurs :
Zidaoui I, Wemmert C, Dufresne M, Joannis C, Isel S, Wertel J, Vazquez J

Résumé

To prevent the pollution of water resources, the measurement and the limitation of wastewater discharges are required. Despite the progress in the field of data acquisition systems, sensors are subject to malfunctions that can bias the evaluation of the pollution flow. It is therefore essential to identify potential anomalies in the data before any use. The objective of this work is to deploy artificial intelligence tools to automate the data validation and to assess the added value of this approach in assisting the validation performed by an operator. To do so, we compare two state-of-the-art anomaly detection algorithms on turbidity data in a sewer network. On the one hand, we conclude that the One-class SVM model is not adapted to the nature of the studied data which is heterogeneous and noisy. The Matrix Profile model, on the other hand, provides promising results with a majority of anomalies detected and a relatively limited number of false positives. By comparing these results to the expert validation, it turns out that the use of the Matrix Profile model objectifies and accelerates the validation task while maintaining the same level of performance compared to the annotator agreement rate between two experts.

Mots clés

Artificial Intelligence, Wastewater, Algorithms, Environmental Pollution, Water Resources

Référence

Water Sci Technol. 2023 06;87(12):2957-2970