#Master-RM20/07/2025
Master-RM: Strengthening Trust in LLM-Based Reward Models Against Superficial Exploits
Master-RM is a new reward model designed to fix vulnerabilities in LLM-based evaluators by reducing false positives caused by superficial cues, ensuring more reliable reinforcement learning outcomes.