AI RESEARCH
Ortho-Hydra: Orthogonalized Experts for DiT LoRA
arXiv CS.LG
•
ArXi:2605.03252v1 Announce Type: new LoRA fine-tuning of diffusion transformers (DiT) on multi-style data suffers from \emph{style bleed}: a single low-rank residual cannot represent several distinct artist fingerprints, and the optimizer converges to their average. Mixture-of-experts LoRA in the HydraLoRA style replaces the up-projection with $E$ heads under a router, but when every expert is zero-initialized the router receives identical gradient from each head and remains at the uniform prior.