Tested 14 embedding models on Thai — here's how they rank
r/LocalLLaMA
•
Generative AI
Ran MTEB benchmarks on 15 Thai tasks using A100 GPUs. Results: Qwen3-Embedding-4B - 74.41 KaLM-Gemma3-12B - 73.92 BOOM_4B_v1 - 71.84 jina-v5-text-small - 71.69 Qwen3-Embedding-0.6B - 69.08 multilingual-e5-large - 67.22 jina-v5-text-nano - 66.85 bge-m3 - 64.77 jina-v3 - 57.81 Qwen3-0.6B is impressive for its size - nearly matches 4B models on Thai. bge-m3 is solid but nothing special for Thai specifically. Interactive leaderboard with per-task breakdown: All benchmarks ran on Thailand's national supercomputer (LANTA). Results merged into the official MTEB repo.