Instructions to use zai-org/chatglm-6b-int8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zai-org/chatglm-6b-int8 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("zai-org/chatglm-6b-int8", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Adding `safetensors` variant of this model
#2 opened over 1 year ago
by
SFconvertbot
推理相比量化前更慢了
2
#1 opened almost 3 years ago
by
luobotaxinghu