S
Sync
Dev
Home
About
Pricing
Blogs
Contact
Start a project
Home
About
Pricing
Blogs
Contact
Start a project
Home
/
Benchmarks
/
GPQA Diamond
GPQA Diamond
Graduate-Level Google-Proof Q&A
Model ranking
#
Model
Score (%)
1
Claude 3.5 Sonnet
Anthropic
65.0%
2
Gemini 1.5 Pro
Google DeepMind
59.1%
3
GPT-4o
OpenAI
53.6%
4
Llama 3.1 405B
Meta AI
51.1%