Real-time outlier detection for large datasets by RT-DetMCD

Bart De ketelaere, Mia Hubert, Jakob Raymaekers, Peter J. Rousseeuw*, Iwein Vranckx

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

27 Downloads (Pure)

Abstract

Modern industrial machines can generate gigabytes of data in seconds, frequently pushing the boundaries of available computing power. Together with the time criticality of industrial processing this presents a challenging problem for any data analytics procedure. We focus on the deterministic minimum covariance determinant method (DetMCD), which detects outliers by fitting a robust covariance matrix. We construct a much faster version of DetMCD by replacing its initial estimators by two new methods and incorporating update-based concentration steps. The computation time is reduced further by parallel computing, with a novel robust aggregation method to combine the results from the threads. The speed and accuracy of the proposed real-time DetMCD method (RT-DetMCD) are illustrated by simulation and a real industrial application to food sorting.
Original languageEnglish
Pages (from-to)103957
JournalChemometrics and Intelligent Laboratory Systems
Volume199
DOIs
Publication statusPublished - 1 Apr 2020
Externally publishedYes

Cite this