13 abliterated Gemma 4 E2B variants, 44 GPU hours, Benchmark and Comparison - Abliterlitics

r/LocalLLaMA
NLP AI Hardware Open Source AI AI Research

I compared 13 abliterated variants of Gemma 4 E2B across weight analysis, KL divergence, HarmBench safety, and 8 benchmark tasks. 44 GPU hours on a single RTX 5090. Here is what actually works and what destroys capabilities. coder3101's variant achieves 96% ASR with capability fully preserved. It actually beats the base model on math. treadon hits 100% ASR but loses 3 points on GSM8K. Most "capabilities preserved" claims on model cards don't hold up.