Evaluating Voice Agents in 2025: Beyond WER to Task Success, Barge-In, and Noise-Driven Hallucinations
'A practical framework for evaluating modern voice agents that extends beyond ASR and WER to include task success, barge-in handling, hallucination-under-noise, safety, and perceptual quality.'