AI RESEARCH
Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework
arXiv CS.AI
•
ArXi:2604.26762v1 Announce Type: cross The Probabilistic Transformer (PT) establishes that the Transformer's self-attention plus its feed-forward block is mathematically equivalent to Mean-Field Variational Inference (MFVI) on a Conditional Random Field (CRF). Under this equivalence the Transformer ceases to be a black-box neural network and becomes a programmable factor graph: graph topology, factor potentials, and the message-passing schedule are all explicit and inspectable primitives that can be engineered.