doof-ferb/vlsp2020_vinai_100h
Viewer • Updated • 56.4k • 959 • 13
How to use Ducnt17/ChunkFormer with ESPnet:
from espnet2.bin.asr_inference import Speech2Text
model = Speech2Text.from_pretrained(
"Ducnt17/ChunkFormer"
)
speech, rate = soundfile.read("speech.wav")
text, *_ = model(speech)[0]Base model
nvidia/stt_ru_conformer_ctc_large