Monday, 22 June 2026 | Updating Daily AI insight, written for builders

NVIDIA Nemotron 3 Nano Omni

NVIDIA Nemotron 3 Nano Omni — Specifications

DeveloperNVIDIA
TypeMultimodal (omni)
ModalityText, Image, Audio, Video → Text
Parameters30B total / ~3B active (MoE)
Context window256K
Max output
LicenseNVIDIA Open Model Agreement
Open weights✅ Yes
Released2026
Input price
Output price
API providersHugging Face, OpenRouter, NVIDIA NIM

🖥️ Run it locally

VRAM (FP16/BF16)~62 GB
VRAM (4-bit)~21 GB (NVFP4)
Minimum GPURTX 5090 32GB (NVFP4) / H100 80GB (BF16)

📊 Benchmarks

OCRBench V267.04
Video-MME72.2
OSWorld47.4
Speech IF89.39

Official page →

NVIDIA’s open omni-modal model — it sees, hears, watches and reads (text, image, audio, video → text) in a single 30B-A3B mixture-of-experts that activates only ~3B parameters per token. A Mamba-Transformer hybrid that runs on one high-end GPU; open weights under the NVIDIA Open Model Agreement (commercial use allowed).

Scroll to Top