How to use kazusam/kt with ESPnet:
from espnet2.bin.tts_inference import Text2Speech model = Text2Speech.from_pretrained("kazusam/kt") speech, *_ = model("text to generate speech from")