Monday, 22 June 2026 | Updating Daily AI insight, written for builders

NVIDIA Nemotron 3 Nano Omni

NVIDIA Nemotron 3 Nano Omni — Specifications

DesenvolvedorNVIDIA
TipoMultimodal (omni)
ModalidadeText, Image, Audio, Video → Text
Parâmetros30B total / ~3B active (MoE)
Janela de contexto256 mil
Saída máxima
LicençaNVIDIA Open Model Agreement
Pesos abertos✅ Yes
Lançamento2026
Input price
Output price
API providersHugging Face, OpenRouter, NVIDIA NIM

🖥️ Run it locally

VRAM (FP16/BF16)~62 GB
VRAM (4-bit)~21 GB (NVFP4)
Minimum GPURTX 5090 32GB (NVFP4) / H100 80GB (BF16)

📊 Benchmarks

OCRBench V267.04
Video-MME72.2
OSWorld47.4
Speech IF89.39

Official page →

NVIDIA’s open omni-modal model — it sees, hears, watches and reads (text, image, audio, video → text) in a single 30B-A3B mixture-of-experts that activates only ~3B parameters per token. A Mamba-Transformer hybrid that runs on one high-end GPU; open weights under the NVIDIA Open Model Agreement (commercial use allowed).

Scroll to Top