S
Sync
Dev
Home
About
Pricing
Blogs
Contact
Start a project
Home
About
Pricing
Blogs
Contact
Start a project
Home
/
Benchmarks
/
HumanEval
HumanEval
Hand-written programming problems
Model ranking
#
Model
Score (pass@1)
1
Claude 3.5 Sonnet
Anthropic
92.0
2
GPT-4o
OpenAI
90.2
3
Llama 3.1 405B
Meta AI
89.0
4
Gemini 1.5 Pro
Google DeepMind
84.1