AI RESEARCH

Ayn: A Tiny yet Competitive Indian Legal Language Model Pretrained from Scratch

arXiv CS.AI

ArXi:2403.13681v3 Announce Type: replace-cross Decoder-only Large Language Models (LLMs) are currently the model of choice for many Natural Language Processing (NLP) applications. Through instruction fine-tuning and prompting approaches, such LLMs have been efficiently used to solve both general and domain-specific tasks. However, they are costly to train and, to a certain extent, costly to use as well, and one can wonder whether LLMs can be replaced by domain-specific Tiny Language Models (TLMs), which typically contain less than 100M parameters.