AI RESEARCH
Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization
arXiv CS.AI
•
ArXi:2603.21676v1 Announce Type: cross Standard Transformers have a fixed computational depth, fundamentally limiting their ability to generalize to tasks requiring variable-depth reasoning, such as multi-hop graph traversal or nested logic. We propose a depth-recurrent Transformer that decouples computational depth from parameter count by iteratively applying a shared-weight Transformer block in latent space -- enabling the model to trade recurrence steps for deeper reasoning at inference time.