#FutureSearch02/06/2025
Evaluating AI Agents: Insights from the Deep Research Bench Report
The Deep Research Bench report by FutureSearch evaluates AI agents on complex research tasks, revealing strengths and key limitations of leading models like OpenAI's o3 and Google Gemini.