Fast facts: Research agent performance metrics

Test-Time Diffusion Deep Researcher (TTD-DR) significantly outperforms existing deep research agents.

TTD-DR achieves 69.1% win rate in LongForm Research tasks compared to OpenAI Deep Research.

Helpfulness and Comprehensiveness are key metrics for evaluating research outputs.

Self-evolution improves individual components of the research agent's workflow.

The final report from TTD-DR is produced after a continuous feedback loop of revisions.