AI RESEARCH
CASCADE: Context-Aware Relaxation for Speculative Image Decoding
arXiv CS.AI
•
ArXi:2605.07230v1 Announce Type: cross Autoregressive generation is a powerful approach for high-fidelity image synthesis, but it remains computationally demanding and slow even on the most advanced accelerators. While speculative decoding has been explored to mitigate this bottleneck, existing approaches fail to achieve efficiency gains comparable to those observed in text generation. A key limitation is the target model's high uncertainty during image generation, which leads to high draft token rejection rates.