Instructions to use Supertone/supertonic with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Supertonic
How to use Supertone/supertonic with Supertonic:
from supertonic import TTS tts = TTS(auto_download=True) style = tts.get_voice_style(voice_name="M1") text = "The train delay was announced at 4:45 PM on Wed, Apr 3, 2024 due to track maintenance." wav, duration = tts.synthesize(text, voice_style=style) tts.save_audio(wav, "output.wav")
- Notebooks
- Google Colab
- Kaggle
Alignment output?
#11
by alistim - opened
Really awesome work, folks! Wondering if you have any plans for character or word-level alignment outputs? Would be helpful for supporting interruptions in my AI voice agent context.
I'd love to know what letter or word was spoken when relative to start of speech, rather than just using linear interpolation. π
Hello! Since Supertonic is not based on phoneme-level duration modeling, providing character or word-level alignments is not straightforward at this time. We appreciate the suggestion and will keep this feature in mind for future consideration.