#MBPP09/10/2025
RA3: Temporal Action Abstractions to Speed Up RL Post-Training in Code LLMs
'RA3 formalizes mid-training as pruning plus horizon shortening and uses temporal action abstractions to accelerate RL post-training, boosting code generation benchmarks.'