Video-Text-to-Text
Transformers
Safetensors
qwen2_5_omni
multimodal
video-understanding
audio-understanding
streaming
real-time
omni-modal
Instructions to use EurekaTian/ROMA with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use EurekaTian/ROMA with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModel processor = AutoProcessor.from_pretrained("EurekaTian/ROMA") model = AutoModel.from_pretrained("EurekaTian/ROMA") - Notebooks
- Google Colab
- Kaggle