Inception: Mercury

Inception: Mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots.

ConversationReasoningCode GenerationAnalysis
Provider
Inception
Release Date
2025-06-26
Size
MEDIUM
Parameters
Not disclosed

Benchmark Performance

Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.

Speed

5-10x faster than GPT-4.1 Nano
View Source

Performance

Matches GPT-4.1 Nano and Claude 3.5 Haiku
View Source

Model Insights

All Model Responses