Llama (Meta)
Llama 4 Scout 10M ctx, Maverick 1M, MoE 17B active
Llama 4 Scout/Maverick (04/05/2025): open-weight MoE family (17B active) on 'Community License' (not OSI): >700M MAU clause, AUP, restrictions on using outputs to improve other LLMs. Scout offers context up to 10M tokens; Maverick — 1M. Natively multimodal (text+image). Meta refused to sign the EU GPAI Code of Practice — relevant for EU compliance.
Verified: 2026-05-22
Purchase decision (when to choose / when to avoid)
Choose if...
- You want open-weight and control (self-host) + wide tool ecosystem.
- You have high volume and want to optimize inference cost (quantizations, vLLM, etc.).
- You're building on-prem / edge solutions and have competence to maintain models.
Avoid if...
- You don't accept Community License / AUP restrictions — check details before deployment.
- You want the 'simplest possible' deployment without MLOps — SaaS API will be faster.
Cost in practice (scenarios)
Hosted at a provider is fastest; self-host requires preparation.
- small team
- quality testing
Self-host often wins on cost at steady volume, but maintenance is added.
- GPU, monitoring, guardrails
Deployment / data / enterprise
Deployment channels
- Self-host (vLLM/llama.cpp/Ollama)
- Hosting providers (e.g. Bedrock / Together / Groq — depending on offering)
- Integrations in custom applications
Data policy
- Training on data
- Self-host: on your side.
- Retention
- Self-host: on your side; hosted: depends on provider.
- Data residency
- Depends on hosting location.
Enterprise readiness
- Admin
- Self-host: on your side; hosted: depends on provider.
- SSO/SCIM
- Depends on the platform you deploy on.
- Audit
- Depends on platform.
- DPA
- Depends on provider/agreement.
- Certifications
- Depends on provider/agreement.