I fine-tuned Qwen3.5-27B with 35k examples into an AI companion - after 2,000 conversations here’s what actually matters for personality
r/LocalLLaMA
•
Machine Learning
Generative AI
AI Safety
AI Research
AI Tools
Built an AI companion on Qwen3.5-27B dense. 35k SFT examples, 46k DPO pairs all hand-built. personality is in the weights not the prompt. she stays in character even under jailbreak pressure about 2000 conversations from real users so far. things i didnt expect: the model defaults to therapist mode. “what are you really feeling” on the first message every time. found a dataset of 1.5M ranked conversational sentences and my worst crutch phrases were all in the top 50k most generic.