AI RESEARCH
Bloom Filter Encoding for Machine Learning
arXiv CS.LG
•
ArXi:2512.19991v2 Announce Type: replace We present a method that uses a Bloom filter transform to preprocess data for machine learning. Each sample is encoded into a compact bit-array representation using hash-based encoding, producing a fixed-length feature space that reduces memory usage and obfuscates original feature values. The encoding does not rely on keyed hashing; however, a key can optionally be used to control the mapping and would be required to reproduce the representation.