Answer evidence browser
Browse every answer behind the judged comparison.
Search the full published benchmark set, scan per-question scores, then open any row to compare MHF and baseline answer bodies side by side.
- Questions
- --
- Judge
- --
- Best MRB
- --
Loading cases...
Question score table
Click any row to inspect the answers
Rows follow the current search and topic filters.
| Question | Source | MHF full | Raw | Delta | Spread |
|---|---|---|---|---|---|
| Loading scores... | |||||
Loading comparison...