GENAIWIKI

Llama 3.2 1B Instruct

Legacy

Llama 3.2 1B Instruct is among the smallest Llama instruct checkpoints for extreme latency and footprint constraints.

Best for:Keyword and intent spottingCost tier:Open / entry

Open weights LLM · Release · Llama community license (see Meta)

edgetinyopen-weights

Updated 1 day ago · Verified Apr 2026 · Score 78

Decision summary

Why teams reach for it, where it fits, and what to watch for — before you dive into specs.

Why teams choose it

  • Often combined with larger models in cascades.
  • Monitor failure modes on out-of-domain prompts.

Best use cases

  • Use this when keyword and intent spotting
  • Use this when embedded firmware coprocessors (where licensed)

Tradeoffs

  • Very limited reasoning depth.
  • Higher hallucination risk without grounding.

Technical details

Modalities, benchmarks, and release context.

Modalities

What goes in and what comes out.

Inputs
text
Outputs
text
Capabilities
ultra-low latency, classification
Release: ·License: Llama community license (see Meta)

Benchmarks snapshot

Structured JSON for reproducible comparisons.

No benchmark data yet — see comparisons for relative performance.

Family lineup

Explore other versions in this family after you have the headline on this model.

Continue exploring

A short set of comparisons, nearby models, and links to go deeper — without repeating the same paths.

This page is based on publicly available documentation, benchmarks, and real-world usage patterns. Last reviewed for accuracy recently.