Nemotron Vision-Language Collection Image-text paired datasets for building vision-language models (VLMs). • 3 items • Updated 3 days ago • 8
view article Article The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics nvidia • Mar 16 • 30
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator nvidia • Dec 17, 2025 • 50
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding nvidia • Mar 19 • 47
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 3 days ago • 142
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 3 days ago • 151
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 7 items • Updated 3 days ago • 46
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 10 items • Updated 3 days ago • 93
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 50 items • Updated 3 days ago • 160
view article Article Nemotron-Personas-India: Synthesized Data for Sovereign AI nvidia • Oct 13, 2025 • 14