AI RESEARCH
Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching
arXiv CS.AI
•
ArXi:2604.08574v1 Announce Type: cross Large Genomic Foundation Models have recently achieved remarkable results and in-vivo translation capabilities. However these models quickly grow to over a few Billion of parameters and are expensive to run when compute is limited. To overcome this challenge, we present a distillation framework for transferring mRNA representations from a state of the art genomic foundation model into a much smaller model specialized for mRNA sequences, reducing the size by 200-fold.