Llama 3.1 405B

Compare models

Meta's largest open-weights model, competitive with frontier closed models.

Key specifications

Intelligence
74.0
Context window
128K
Max output
4.1K
Output speed
30 t/s
Latency (TTFT)
0.70s
Input $/1M
$3.50
Output $/1M
$3.50
License
Llama 3.1 Community
Architecture
Transformer (dense)
Parameters
405B

Capabilities

Function callingStreamingJSON modeFine-tuningOpen weights

Strengths

  • + Open weights — self-hostable
  • + Strong general performance

Limitations

  • Heavy to run
  • Slower hosted throughput

Benchmark scores