#VL-Cogito09/08/2025
VL-Cogito: Curriculum RL and Adaptive Length Rewards Transform Multimodal Reasoning
'VL-Cogito uses a staged curriculum RL and adaptive length rewards to significantly boost multimodal reasoning on math, science and chart benchmarks, outperforming several prior MLLMs.'