Synthesizing Instruction-Tuning Datasets with Contrastive Decoding

ArXi:2604.13538v1 Announce Type: new Using responses generated by high-performing large language models (LLMs) for instruction tuning has become a widely adopted approach. However, the existing literature overlooks a property of LLM-generated responses: they conflate world knowledge acquired during pre-