{% extends "base.html" %} {% set active_page = 'benchmark' %} {% block title %}Benchmark - Deep Research System{% endblock %} {% block extra_head %} {% endblock %} {% block content %}

Current Configuration

Loading...

Select Datasets to Test

SimpleQA

Fact-based questions with clear answers

BrowseComp

Complex browsing and comparison tasks

Benchmark Progress

--%
Accuracy
0
Completed
--
Per Minute
--
Time Left
{% endblock %}