I built an open-source LLM eval framework as a BCA student — hallucination detection, red-teaming, regression tracking
Dev.to AI
•
Generative AI
AI Safety
Open Source AI
AI Research
AI Business
## The Problem Every company building AI products needs to know if their LLM is actually working - or getting worse over time. This is harder than it sounds. I built an open-source evaluation framework to solve this.