TY - GEN
T1 - Anomaly Detection under Contaminated Data
T2 - 12th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2025
AU - Donne, Stefano
AU - Davis, Jesse
AU - Van Utterbeeck, Filip
AU - Verbeke, Mathias
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Reliable anomaly detection under data contamination remains a major challenge in Prognostics and Health Management, especially when degradation processes are gradual and clean training data are unavailable. This paper introduces a weighted iterative refinement framework with autoencoders for contaminated anomaly detection (WIRACAD) to address this problem. The method, which is based on reconstruction residuals, re-weights training samples across iterations in order to progressively reduce the influence of suspected anomalies. This continuous refinement improves the robustness of health indicator learning from contaminated time series. The proposed approach is evaluated on two public benchmarks: The NASA C-MAPSS dataset and the IMS Bearing dataset. Results show consistent improvements in key metrics related to degradation monitoring. In particular, the overal fit score and the monotonicty are improved when compared to baseline autoencoder training and recent refinement-based methods. These findings suggest that iterative sample weighting can enhance unsupervised anomaly detection with autoencoders in settings where data contamination is assumed.
AB - Reliable anomaly detection under data contamination remains a major challenge in Prognostics and Health Management, especially when degradation processes are gradual and clean training data are unavailable. This paper introduces a weighted iterative refinement framework with autoencoders for contaminated anomaly detection (WIRACAD) to address this problem. The method, which is based on reconstruction residuals, re-weights training samples across iterations in order to progressively reduce the influence of suspected anomalies. This continuous refinement improves the robustness of health indicator learning from contaminated time series. The proposed approach is evaluated on two public benchmarks: The NASA C-MAPSS dataset and the IMS Bearing dataset. Results show consistent improvements in key metrics related to degradation monitoring. In particular, the overal fit score and the monotonicty are improved when compared to baseline autoencoder training and recent refinement-based methods. These findings suggest that iterative sample weighting can enhance unsupervised anomaly detection with autoencoders in settings where data contamination is assumed.
KW - Anomaly Detection
KW - Autoencoders
KW - Data Contamination
KW - Degradation Monitoring
KW - Iterative Refinement
KW - Prognostic and Health Management
UR - https://www.scopus.com/pages/publications/105029909937
U2 - 10.1109/DSAA65442.2025.11248011
DO - 10.1109/DSAA65442.2025.11248011
M3 - Conference contribution
AN - SCOPUS:105029909937
T3 - 2025 IEEE 12th International Conference on Data Science and Advanced Analytics, DSAA 2025
BT - 2025 IEEE 12th International Conference on Data Science and Advanced Analytics, DSAA 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 9 October 2025 through 12 October 2025
ER -