Dropping learning rate fixed my Qlora fine-tune more than anything else i tried
r/LocalLLaMA
•
Machine Learning
Generative AI
Open Source AI
AI Research
Been fine-tuning llama 3.1 8b with Qlora for a classification task using about 8k samples. I was getting bad eval results for a while and kept thinking something was wrong with my data. Tried cleaning the dataset, tried different prompt templates, messed with rank and alpha. Nothing realy changed. Dropped the learning rate from 2e-4 to 1e-4 and bumped epochs from 3 to 5. Ran it on a 5090 I rent on Hyperai since our lab machines are always booked. Completley different results. Same data, same everything else. 2e-4 is just too agressive when your dataset is that small.