Running on Zero Agents Featured 1.91k Qwen3-TTS Demo π 1.91k Generate speech audio from text with custom or cloned voices
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 375k β’ 1.6k
Running on CPU Upgrade Agents Featured 1.33k Open ASR Leaderboard π 1.33k Explore and compare speechβrecognition model benchmarks
nguyenvulebinh/AV-HuBERT-MuAViC-multilingual Text Generation β’ 0.4B β’ Updated Mar 6, 2025 β’ 46 β’ 2
Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations Paper β’ 2503.06273 β’ Published Mar 8, 2025 β’ 6
Running 341 LLM Embeddings Explained: A Visual and Intuitive Guide π 341 How Language Models Turn Text into Meaning, From Traditional
Running on Zero Agents 23 Mitsua Likes Demo π 23 Text-to-Image Diffusion Model trained on licensed/pd data