#VeRL25/11/2025
Agent0: Self-Evolving LLMs That Learn Tools and Solve Math Without External Data
'Agent0 co-evolves a curriculum agent and an executor from the same base LLM, using sandboxed Python tool calls and ambiguity-aware RL to improve math and general reasoning without external data.'