AI RESEARCH

ZAYA1-8B Technical Report

arXiv CS.CL

ArXi:2605.05365v1 Announce Type: cross We present ZAYA1-8B, a reasoning-focused mixture-of-experts (MoE) model with 700M active and 8B total parameters, built on Zyphra's MoE++ architecture. ZAYA1-8B's core pre