Quiz on evaluation benchmarks for AI browser agents