Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks

ArXi:2601.03448v2 Announce Type: replace Language models (LMs) are pre-trained on raw text datasets to generate text sequences token-by-token. While this approach facilitates the learning of world knowledge and reasoning, it does not explicitly optimize for linguistic competence. To bridge this gap, we propose L2T, a pre-