FILTER MODE ACTIVE

#actor-critic

Records found: 2

#actor-critic05/11/2025

Train a Model-Native Agent to Internalize Planning, Memory and Tool Use with End-to-End RL

'A compact neural agent learns to plan, store and compose symbolic tools end-to-end with reinforcement learning, demonstrating emergent multi-step reasoning on synthetic arithmetic tasks.'

READ →

#actor-critic30/06/2025

DSRL: Steering Robot Policies via Latent-Space Reinforcement Learning for Real-World Adaptation

DSRL introduces a novel method to adapt diffusion-based robotic policies via latent-space reinforcement learning, significantly boosting real-world task performance without modifying base models.

READ →