Tiny Inference-Time Scaling with Latent Verifiers

ArXi:2603.22492v1 Announce Type: cross Inference-time scaling has emerged as an effective way to improve generative models at test time by using a verifier to score and select candidate outputs. A common choice is to employ Multimodal Large Language Models (MLLMs) as verifiers, which can improve performance but