NVIDIA develops the Nemotron family and provides leading AI hardware and software stacks (CUDA, RTX, TensorRT) for accelerated inference.
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, designed as a unified model for reasoning and non-reasoning tasks. It can expose an internal reasoning trace and then produce a final answer, or be configured via system prompt to only provide final answers without intermediate traces.