whisper.cpp / README.md
jiwook-mobilint's picture
Upload README.md with huggingface_hub
f810329 verified
|
Raw
History Blame Contribute Delete
2.23 kB
metadata
license: apache-2.0
base_model: openai/whisper-small
tags:
  - whisper
  - automatic-speech-recognition
  - ggml
  - mobilint
  - npu
language:
  - en
  - zh
  - de
  - es
  - ru
  - ko
  - fr
  - ja
  - pt
  - tr
  - pl
  - multilingual
library_name: whisper.cpp

whisper.cpp Models for Mobilint NPU

This repository provides all model files needed to run whisper.cpp-mblt, the Mobilint NPU-accelerated fork of whisper.cpp.

Available Files

Model File Size Description
whisper-small ggml-small.bin 466 MB GGML model (tokenizer + weights for CPU fallback)
whisper-small ggml-small-encoder.mxq 93 MB Mobilint NPU encoder
whisper-small ggml-small-decoder.mxq 159 MB Mobilint NPU decoder

Usage

NPU Inference (Mobilint)

# Download all files and run
whisper-cli-mblt \
  -m ggml-small.bin \
  --mxq-encoder ggml-small-encoder.mxq \
  --mxq-decoder ggml-small-decoder.mxq \
  -f audio.wav

# Or auto-download from HuggingFace
whisper-cli-mblt -hf mobilint/whisper-small -f audio.wav

CPU Inference (standard whisper.cpp)

The ggml-small.bin file is also compatible with standard whisper.cpp for CPU-only inference:

whisper-cli -m ggml-small.bin -f audio.wav

Model Details

  • Base model: openai/whisper-small (244M parameters)
  • Languages: 99 languages supported (English, Chinese, German, Spanish, Russian, Korean, French, Japanese, Portuguese, Turkish, Polish, and more)
  • Tasks: Transcription and translation (to English)
  • NPU pipeline: Audio → mel spectrogram (CPU) → encoder (NPU, global4) → decoder (NPU, single core, greedy) → text

Related Repositories

License

Apache 2.0 (same as the original OpenAI Whisper model)