AI RESEARCH
Test-Time Hinting for Black-Box Vision-Language Models
arXiv CS.CV
•
ArXi:2605.16410v1 Announce Type: new Test-time scaling (TTS) methods have proven highly effective for LLMs, yet their application to vision-language models (VLMs) remains relatively underexplored. Existing VLM TTS methods largely require open-weight model access or expensive repeated sampling, and are evaluated primarily on multimodal mathematical and scientific reasoning benchmarks rather than general visual understanding tasks.