AI RESEARCH

DS$^2$-Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning

arXiv CS.CL

ArXi:2603.12932v1 Announce Type: new Adapting Large Language Models (LLMs) to specialized domains requires high-quality instruction tuning datasets, which are expensive to create through human annotation. Existing data synthesis methods focus on general-purpose tasks and fail to capture domain-specific terminology and reasoning patterns. To address this, we