Llama 3.1 405B Instruct

CurrentLatest

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization.

View provider docs Try it →

Provider

Meta

Model family

Meta Llama

Open weights LLM

Cost tier

Open / entry

Status

Current

Release Jul 23, 2024

Why teams choose it

🧠

Serving 405B-class weights requires multi-GPU inference—budget hardware and ops explicitly.

📎

License obligations include acceptable use and attribution—legal review before redistribution.

Tradeoffs to know

Not a drop-in for tiny edge devices—needs serious infrastructure.
Safety tooling is operator responsibility when self-hosting.

When not to use this

Self-hosting outcomes depend on hardware, quantization, and ops maturity—budget time beyond swapping an API hostname.
May demand more instrumentation than SaaS-managed APIs to duplicate latency, failover, and support guarantees.
Benchmark prompts and regressions continuously before rewriting entire routing tables around weights.

Technical specs