Gemini 2.5’s top coding benchmark?

 title: 'Objects arranged in different layouts for SVG reconstruction prompt.'

Gemini 2.5 Pro excels at coding tasks and represents a marked improvement over previous models[1]. Performance on LiveCodeBench increased from 30.5% for Gemini 1.5 Pro to 69.0% for Gemini 2.5 Pro, while that for Aider Polyglot went from 16.9% to 82.2%[1].

Relative to other large language models, Gemini achieves the state-of-the-art (SoTA) score on the Aider Polyglot coding task[1]. Gemini also achieves the highest score on Humanity’s Last Exam, GPQA (diamond), and on the SimpleQA and FACTS Grounding factuality benchmarks out of all of the models examined[1].