Instructions to use nvidia/omnivinci with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nvidia/omnivinci with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("nvidia/omnivinci", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Can this run on a Jetson AGX Orin 32GB
#6
by SRai22 - opened
Trying to understand if this model can be loaded for inference on a Jetson AGX Orin?
Yes, it can be. But takes about 15 seconds per image on average to infer
thank you @SRai22 for the follow up. We did some inference optimization for https://huggingface.co/nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16