AI RESEARCH
ProtSent: Protein Sentence Transformers
arXiv CS.LG
•
ArXi:2605.06830v1 Announce Type: new Protein language models (pLMs) produce per-residue representations that capture evolutionary and structural information, yet their mean-pooled sequence embeddings are not explicitly trained to reflect functional, evolutionary or structural similarity between proteins. We present Protein Sentence Transformers (ProtSent), a contrastive fine-tuning framework for adapting PLMs into general-purpose embedding models.