KiaDev Intelligence

#Polaris-4B27/06/2025

Polaris-4B and Polaris-7B: Scalable Reinforcement Learning Unlocks Advanced Math and Logic Reasoning

Polaris-4B and Polaris-7B introduce a novel reinforcement learning recipe that scales reasoning capabilities efficiently, achieving state-of-the-art results on math benchmarks with smaller models.

READ →