AI RESEARCH
Cross-Granularity Representations for Biological Sequences: Insights from ESM and BiGCARP
arXiv CS.LG
•
ArXi:2603.20825v1 Announce Type: new Recent advances in general-purpose foundation models have stimulated the development of large biological sequence models. While natural language shows symbolic granularity (characters, words, sentences), biological sequences exhibit hierarchical granularity whose levels (nucleotides, amino acids, protein domains, genes) further encode biologically functional information.