AI RESEARCH

How Private Are DNA Embeddings? Inverting Foundation Model Representations of Genomic Sequences

arXiv CS.LG

ArXi:2603.06950v1 Announce Type: cross DNA foundation models have become transformative tools in bioinformatics and healthcare applications. Trained on vast genomic datasets, these models can be used to generate sequence embeddings, dense vector representations that capture complex genomic information. These embeddings are increasingly being shared via Embeddings-as-a-Service (EaaS) frameworks to facilitate downstream tasks, while supposedly protecting the privacy of the underlying raw sequences.