AI RESEARCH

ENSEMBITS: an alphabet of protein conformational ensembles

arXiv CS.AI

ArXi:2605.13789v1 Announce Type: cross Protein structure tokenizers (PSTs) are workhorses in protein language modeling, function prediction, and evolutionary analysis. However, existing PSTs only capture local geometry of static structures, and miss the correlated motions and alternative conformational states revealed by protein ensembles. Here we