I fine-tuned Qwen3.5-27B with 35k examples into an AI companion - after 2,000 conversations here’s what actually matters for personality

r/LocalLLaMA
Machine Learning Generative AI AI Safety AI Research AI Tools

Built an AI companion on Qwen3.5-27B dense. 35k SFT examples, 46k DPO pairs all hand-built. personality is in the weights not the prompt. she stays in character even under jailbreak pressure about 2000 conversations from real users so far. things i didnt expect: the model defaults to therapist mode. “what are you really feeling” on the first message every time. found a dataset of 1.5M ranked conversational sentences and my worst crutch phrases were all in the top 50k most generic.