| --- |
| license: apache-2.0 |
| base_model: openai/whisper-small |
| tags: |
| - whisper |
| - automatic-speech-recognition |
| - ggml |
| - mobilint |
| - npu |
| language: |
| - en |
| - zh |
| - de |
| - es |
| - ru |
| - ko |
| - fr |
| - ja |
| - pt |
| - tr |
| - pl |
| - multilingual |
| library_name: whisper.cpp |
| --- |
| |
| # whisper.cpp Models for Mobilint NPU |
|
|
| This repository provides all model files needed to run [whisper.cpp-mblt](https://git.mobilint.com/algorithm-team/integration/whisper.cpp-mblt), the Mobilint NPU-accelerated fork of [whisper.cpp](https://github.com/ggml-org/whisper.cpp). |
|
|
| ## Available Files |
|
|
| | Model | File | Size | Description | |
| |-------|------|------|-------------| |
| | whisper-small | `ggml-small.bin` | 466 MB | GGML model (tokenizer + weights for CPU fallback) | |
| | whisper-small | `ggml-small-encoder.mxq` | 93 MB | Mobilint NPU encoder | |
| | whisper-small | `ggml-small-decoder.mxq` | 159 MB | Mobilint NPU decoder | |
|
|
| ## Usage |
|
|
| ### NPU Inference (Mobilint) |
|
|
| ```bash |
| # Download all files and run |
| whisper-cli-mblt \ |
| -m ggml-small.bin \ |
| --mxq-encoder ggml-small-encoder.mxq \ |
| --mxq-decoder ggml-small-decoder.mxq \ |
| -f audio.wav |
| |
| # Or auto-download from HuggingFace |
| whisper-cli-mblt -hf mobilint/whisper-small -f audio.wav |
| ``` |
|
|
| ### CPU Inference (standard whisper.cpp) |
|
|
| The `ggml-small.bin` file is also compatible with standard whisper.cpp for CPU-only inference: |
|
|
| ```bash |
| whisper-cli -m ggml-small.bin -f audio.wav |
| ``` |
|
|
| ## Model Details |
|
|
| - **Base model**: [openai/whisper-small](https://huggingface.co/openai/whisper-small) (244M parameters) |
| - **Languages**: 99 languages supported (English, Chinese, German, Spanish, Russian, Korean, French, Japanese, Portuguese, Turkish, Polish, and more) |
| - **Tasks**: Transcription and translation (to English) |
| - **NPU pipeline**: Audio β mel spectrogram (CPU) β encoder (NPU, global4) β decoder (NPU, single core, greedy) β text |
|
|
| ## Related Repositories |
|
|
| - **[mobilint/whisper-small](https://huggingface.co/mobilint/whisper-small)** β Original Mobilint whisper-small model with config.json and tokenizer files |
| - **[ggerganov/whisper.cpp](https://huggingface.co/ggerganov/whisper.cpp)** β Upstream GGML models for CPU inference |
|
|
| ## License |
|
|
| Apache 2.0 (same as the original OpenAI Whisper model) |
|
|