A/B Testing RAG Pipelines: Chunk Size, Retrieval, Embeddings, and Prompts

Towards AI
Generative AI Data Science

How to know if your changes actually work - paired t-test, Cohen’s d, and a reusable experiment framework built locally with Ollama