13 abliterated Gemma 4 E2B variants, 44 GPU hours, Benchmark and Comparison - Abliterlitics
r/LocalLLaMA
•
NLP
AI Hardware
Open Source AI
AI Research
I compared 13 abliterated variants of Gemma 4 E2B across weight analysis, KL divergence, HarmBench safety, and 8 benchmark tasks. 44 GPU hours on a single RTX 5090. Here is what actually works and what destroys capabilities. coder3101's variant achieves 96% ASR with capability fully preserved. It actually beats the base model on math. treadon hits 100% ASR but loses 3 points on GSM8K. Most "capabilities preserved" claims on model cards don't hold up.