R-Zero: New AI Framework Enables LLMs to Self-Evolve, Boosting Reasoning Skills Without Human Input

R-Zero: Self-Evolving Reasoning LLM from Zero Data

View PDF HTML (experimental) Abstract:Self-evolving Large Language Models (LLMs) offer a scalable path toward super-intelligence by autonomously generating, refining, and learning from their own experiences. However, existing methods for training such models still rely heavily on vast human-curated tasks and labels, typically via fine-tuning or reinforcement learning, which poses a fundamental bottleneck to advancing AI systems toward capabilities beyond human intelligence. To overcome this limi...