AI RESEARCH
[P] ColQwen3.5-v1 4.5B SOTA on ViDoRe V1 (nDCG@5 0.917)
r/MachineLearning
•
Sharing a model I've been working on: ColQwen3.5-v1, a 4.5B param model built on Qwen3.5-4B using the ColPali late-interaction approach. Currently on ViDoRe V1 ( nDCG 0.917 ) & competitive on ViDoRe V3. Trained across 4 phases including hard negative mining and domain specialization on finance/table docs. Apache 2.0, weights on HF: & PR raised to merge in Working on v2 to simplify the