Devstral-Small-2-24B fine-tuned on Claude 4.6 Opus reasoning traces [GGUF Q4+Q5]
r/LocalLLaMA
•
Machine Learning
Generative AI
AI Research
I fine-tuned Devstral-Small-2-24B on 2,322 Claude 4.6 Opus reasoning traces to give it explicit chain-of-thought before writing code. **Model:** **Files available:** - Q4_K_M GGUF (14.3GB) - Q5_K_M GGUF (16.8GB) ← recommended - LoRA adapter (370MB) for merging yourself **Hardware used:** RTX 3090 24GB **Framework:** Unsloth + QLoRA (r=16) **Checkpoint:** End of epoch 2 (~1200 steps) - better generalisation than full epoch 3 The main challenge was that Devstral is a VLM (Pixtral vision encoder) which made direct text-only.