Llama 4 Scout
Llama 4 Scout is Meta's compact yet powerful multimodal model with 17B active parameters and 16 experts (109B total parameters). It fits on a single H100 GPU with Int4 quantization and offers an industry-leading 10M token context window, outperforming Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across various benchmarks.
ConversationReasoningCode GenerationAnalysis
Provider
Meta
Release Date
2025-04-05
Size
MEDIUM
Parameters
17B active (109B total)
Benchmark Performance
Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.
Context Length
10M tokens