700KB embedding model that actually works, built a full family of static models from 0.7MB to 125MB
r/LocalLLaMA
•
Generative AI
Hey everyone, Yesterday I shared some static embedding models I'd been working on using model2vec + tokenlearn. Since then I've been grinding on improvements and ended up with something I think is pretty cool, a full family of models ranging from 125MB down to 700KB, all drop-in compatible with model2vec and sentence-transformers.