keithito/lj_speech
Updated • 1.31k • 62
How to use Asim037/wav22vec22-stt with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Asim037/wav22vec22-stt") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("Asim037/wav22vec22-stt")
model = AutoModelForCTC.from_pretrained("Asim037/wav22vec22-stt")This model is a fine-tuned version of facebook/wav2vec2-large-960h on the lj_speech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1658.3287 | 2.8571 | 50 | 239.6553 |
| 335.9539 | 5.7143 | 100 | 214.7313 |
| 279.8806 | 8.5714 | 150 | 209.9573 |
Base model
facebook/wav2vec2-large-960h