AI RESEARCH

BioBlobs: Unsupervised Discovery of Functional Substructures for Protein Function Prediction

arXiv CS.AI

ArXi:2510.01632v2 Announce Type: replace-cross Protein function is driven by cohesive substructures, such as catalytic triads, binding pockets, and structural motifs, that occupy only a small fraction of a protein's residues. Yet existing pipelines built on protein encoders do not model proteins at the substructure level, leaving the central biological question unanswered: \emph{which substructure of a protein is responsible for its function?} We