pyannote/speaker-diarization-3.1
Automatic Speech Recognition β’ Updated β’ 10.2M β’ 1.77k
Detect and visualize human poses in images and videos
Generate speech from text using a reference voice
β¨[With v1.0.0] Accelerated TTS on Kokoro-82M
Fast, efficient, & multilingual text-to-speech
Generate spoken audio from text using Edge TTS
Efficient, fast, and natural text to speech with StyleTTS 2!
High-fidelity Text-To-Speech
Generate realistic audio from text
Generate spoken audio from text using selectable voices
Vote on the latest TTS models!
Transcribe audio files into text
High-quality speech synthesis powered by Kokoro TTS