Llama 4 Maverick

Trending

Meta · Active · Updated May 18, 2026

Meta's most capable open-weight model delivering frontier performance with full model customization and community innovation.

Input Price
$1.50/M
per million tokens
Output Price
$4.50/M
per million tokens
Context Window
262,144
tokens
Max Output
16,384
tokens

Technical Specifications

ProviderMeta
Release DateApril 1, 2026
Pricing Typeper token
Input Price$1.5.00 / 1M tokens
Output Price$4.5.00 / 1M tokens
Cached Input
Context Window262,144 tokens
Max Output16,384 tokens
Input Modalitiestext, image
Output Modalitiestext
Statusactive
Availabilityapi, enterprise
Latencymedium
Rate Limit1,500 RPM
Pricing URLView official pricing →
Docs URLView documentation →

Capability Scores

Coding
86
Reasoning
84
Math
82
Image
72
Speed
70

Overview

Llama 4 Maverick represents Meta's most advanced open-weight model, delivering performance that rivals proprietary frontier models like GPT-5.4 and Claude 4 Sonnet. With a 256K context window, native image input, and strong coding and reasoning capabilities, Maverick is fully open for self-hosting, fine-tuning, and customization. It has quickly become the foundation of the open-source AI ecosystem.

Pros

  • +Fully open-weight — free to use, modify, and self-host
  • +Competitive with proprietary frontier models (coding: 86/100)
  • +256K context window with native image input
  • +Extensive community ecosystem with fine-tuned variants

Cons

  • Requires significant hardware for self-hosting
  • Slower inference than smaller optimized models
  • Smaller ecosystem of tools compared to OpenAI/Anthropic

Compare with Alternatives

Use Cases

Self-hosted AI deployments for data privacy
Fine-tuning and custom model development
Research and academic applications

Frequently Asked Questions about Llama 4 Maverick

How much does Llama 4 Maverick cost?
Llama 4 Maverick costs $1.5 per million input tokens and $4.5 per million output tokens.
What is the context window of Llama 4 Maverick?
Llama 4 Maverick has a 262,144 token context window, with a maximum output of 16,384 tokens.
Is Llama 4 Maverick good for coding?
Llama 4 Maverick scores 86/100 on coding benchmarks.
What modalities does Llama 4 Maverick support?
Llama 4 Maverick supports text, image input and text output.