# NVIDIA: Nemotron 3 Ultra: AI model fact sheet

- **Provider:** nvidia
- **Released:** 2026-06-04
- **Context window:** 1,000,000 tokens
- **API pricing:** $0.000 / 1M input, $0.000 / 1M output
- **OpenRouter ID:** nvidia/nemotron-3-ultra-550b-a55b:free
- **Capabilities:** conversation, reasoning, code-generation, analysis, agentic-tool-use, tool-use, planning

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it supports text input and output with a context window of up to 1M tokens. It is suited for long-running agentic workflows, including agent orchestration, coding agents, deep research, and complex multi-step reasoning.

Source: real side-by-side outputs, pricing and specs at https://rival.tips/models/nemotron-3-ultra