FreedomIntelligence/HuatuoGPT-3-32B · Hugging Face
r/LocalLLaMA
•
Generative AI
Open Source AI
AI Tools
Reinforcement Learning
HuatuoGPT-3 is an open-source medical LLM trained with SeedRL, an RL-only domain adaptation paradigm that transforms a base model into a medical expert in a single RL stage. 8B is also available: submitted by /u/jacek2023 [link] [comments]