Llama 4 Maverick
TrendingMeta · Active · Updated May 18, 2026
Meta's most capable open-weight model delivering frontier performance with full model customization and community innovation.
Input Price
$1.50/M
per million tokens
Output Price
$4.50/M
per million tokens
Context Window
262,144
tokens
Max Output
16,384
tokens
Technical Specifications
| Provider | Meta |
| Release Date | April 1, 2026 |
| Pricing Type | per token |
| Input Price | $1.5.00 / 1M tokens |
| Output Price | $4.5.00 / 1M tokens |
| Cached Input | — |
| Context Window | 262,144 tokens |
| Max Output | 16,384 tokens |
| Input Modalities | text, image |
| Output Modalities | text |
| Status | active |
| Availability | api, enterprise |
| Latency | medium |
| Rate Limit | 1,500 RPM |
| Pricing URL | View official pricing → |
| Docs URL | View documentation → |
Capability Scores
Coding86
Reasoning84
Math82
Image72
Speed70
Overview
Llama 4 Maverick represents Meta's most advanced open-weight model, delivering performance that rivals proprietary frontier models like GPT-5.4 and Claude 4 Sonnet. With a 256K context window, native image input, and strong coding and reasoning capabilities, Maverick is fully open for self-hosting, fine-tuning, and customization. It has quickly become the foundation of the open-source AI ecosystem.
Pros
- +Fully open-weight — free to use, modify, and self-host
- +Competitive with proprietary frontier models (coding: 86/100)
- +256K context window with native image input
- +Extensive community ecosystem with fine-tuned variants
Cons
- −Requires significant hardware for self-hosting
- −Slower inference than smaller optimized models
- −Smaller ecosystem of tools compared to OpenAI/Anthropic
Compare with Alternatives
Use Cases
Self-hosted AI deployments for data privacy
Fine-tuning and custom model development
Research and academic applications