AI RESEARCH

Comparison of Outlier Detection Algorithms on String Data

arXiv CS.LG

ArXi:2603.11049v1 Announce Type: new Outlier detection is a well-researched and crucial problem in machine learning. However, there is little research on string data outlier detection, as most literature focuses on outlier detection of numerical data. A robust string data outlier detection algorithm could assist with data cleaning or anomaly detection in system log files. In this thesis, we compare two string outlier detection algorithms. Firstly, we