KiaDev Intelligence

#Clip-Cov03/06/2025

Shanghai AI Lab Unveils Entropy-Based Scaling Laws to Tackle Exploration Collapse in Reinforcement Learning for LLMs

Shanghai AI Lab researchers propose entropy-based scaling laws and novel techniques to overcome exploration collapse in reinforcement learning for reasoning-centric large language models, achieving significant performance improvements.

READ →