AI RESEARCH

Prosodic Boundary-Aware Streaming Generation for LLM-Based TTS with Streaming Text Input

arXiv CS.AI

ArXi:2603.06444v1 Announce Type: cross Streaming TTS that receives streaming text is essential for interactive systems, yet this scheme faces two major challenges: unnatural prosody due to missing lookahead and long-form collapse due to unbounded context. We propose a prosodic-boundary-aware post-