#Polaris-4B27/06/2025
Polaris-4B and Polaris-7B: Scalable Reinforcement Learning Unlocks Advanced Math and Logic Reasoning
Polaris-4B and Polaris-7B introduce a novel reinforcement learning recipe that scales reasoning capabilities efficiently, achieving state-of-the-art results on math benchmarks with smaller models.