FreedomIntelligence/HuatuoGPT-3-32B · Hugging Face

r/LocalLLaMA
Generative AI Open Source AI AI Tools Reinforcement Learning

HuatuoGPT-3 is an open-source medical LLM trained with SeedRL, an RL-only domain adaptation paradigm that transforms a base model into a medical expert in a single RL stage. 8B is also available: submitted by /u/jacek2023 [link] [comments]