I built an open-source LLM eval framework as a BCA student — hallucination detection, red-teaming, regression tracking

Dev.to AI
Generative AI AI Safety Open Source AI AI Research AI Business

## The Problem Every company building AI products needs to know if their LLM is actually working - or getting worse over time. This is harder than it sounds. I built an open-source evaluation framework to solve this.