| --- |
| license: apache-2.0 |
| language: |
| - zh |
| base_model: |
| - hfl/chinese-lert-base |
| tags: |
| - punctuation-restoration |
| --- |
| <div align="center"> |
| <h1>FireRedChat-punc</h1> |
| </div> |
| <div align="center"> |
| <a href="https://fireredteam.github.io/demos/firered_chat/">Demo</a> • |
| <a href="https://arxiv.org/pdf/2509.06502">FireRedChat Paper</a> • |
| <a href="https://huggingface.co/FireRedTeam">Huggingface</a> |
| </div> |
|
|
| ## Descriptions |
| FireRedChat-punc is a fine-tuned `hfl/chinese-lert-base` model designed for punctuation restoration, primarily for post-processing in [FireRedASR](https://github.com/FireRedTeam/FireRedASR). |
|
|
| The model restores the following punctuation marks: [, 。 ? !]. It supports both Chinese and English text, enhancing the readability of transcribed text. |
|
|
| ## Roadmap |
| - [x] 2025/09 |
| - [x] Release the fine-tuned punctuation restoration model. |
|
|
| ## Usage |
|
|
| RedPost source code [github](https://github.com/FireRedTeam/FireRedChat/tree/main/fireredasr-server/server/redpost) |
| Below is an example of how to use the FireRedChat-punc model for punctuation restoration: |
|
|
| ```bash |
| git clone https://huggingface.co/hfl/chinese-lert-base FireRedChat-punc/chinese-lert-base |
| ``` |
|
|
| ```python |
| import os |
| from redpost import RedPost, RedPostConfig |
| |
| punc_model_dir = "./FireRedChat-punc" |
| post_config = RedPostConfig( |
| use_gpu=True, |
| sentence_max_length=30 |
| ) |
| post_model = RedPost.from_pretrained(punc_model_dir, post_config) |
| batch_post_results = post_model.process([text], ["text"]) |
| text = "".join([r["punc_text"] for r in batch_post_results]) |
| text = re.sub("<unk>|<UNK>|\[unk\]|\[UNK\]", "", text) |
| print(text) |
| ``` |
|
|
| ## Use with FireRedASR |
|
|
| This punctuation restoration model can be used together with FireRedASR, refer to [fireredasr-server](https://github.com/FireRedTeam/FireRedChat/blob/main/fireredasr-server/README.md) for setup instructions. |
| https://github.com/FireRedTeam/FireRedChat/tree/main/fireredasr-server |
|
|
| ## License |
| The model and source code are licensed under the Apache-2.0 license. |
|
|
| ### Acknowledgment |
| - Base model: `hfl/chinese-lert-base` (license: apache-2.0) |
| - Designed for integration with [FireRedASR](https://github.com/FireRedTeam/FireRedASR). |