Spoiler Alert: Narrative Forecasting as a Metric for Tension in LLM Storytelling

ArXi:2604.09854v1 Announce Type: new LLMs have so far failed both to generate consistently compelling stories and to recognize this failure--on the leading creative-writing benchmark (EQ-Bench), LLM judges rank zero-shot AI stories above New Yorker short stories, a gold standard for literary fiction. We argue that existing rubrics overlook a key dimension of compelling human stories: narrative tension. We