Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Cyber-Blacat 's Collections
Vedio
2D-generation
LLM
3D
Ultimate-HQ-Datasets
Function-Space
Vision
Multimodal
Sound(ASR+TTS)

Multimodal

updated 1 day ago
Upvote
-

  • BAAI/Emu3.5

    Any-to-Any • 34B • Updated Dec 25, 2025 • 308 • 171

  • HIT-TMG/Uni-MoE-2.0-Omni

    Any-to-Any • 33B • Updated Nov 24, 2025 • 57 • 36

  • HIT-TMG/Uni-MoE-2.0-Image

    Text-to-Image • 31B • Updated Nov 23, 2025 • 171 • 4

  • Yuanshi/ViBT

    Any-to-Any • Updated Dec 7, 2025 • 63 • 19

  • inclusionAI/LLaDA2.0-flash

    103B • Updated Dec 19, 2025 • 545 • 69

  • LiquidAI/LFM2.5-1.2B-Instruct

    Text Generation • 1B • Updated Mar 30 • 424k • 593

  • Glanty/Capybara

    Any-to-Any • Updated Feb 27 • 232

  • google/gemma-4-31B-it

    Image-Text-to-Text • 33B • Updated about 9 hours ago • 10.9M • • 2.79k

  • facebook/tribev2

    Updated Mar 27 • 205k • 547

  • nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B

    Text Generation • 4B • Updated Jan 8 • 702
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs