Text Generation
Transformers
Safetensors
English
mistral
Eval Results (legacy)
text-generation-inference
Instructions to use cognAI/lil-c3po with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use cognAI/lil-c3po with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="cognAI/lil-c3po")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("cognAI/lil-c3po") model = AutoModelForCausalLM.from_pretrained("cognAI/lil-c3po") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use cognAI/lil-c3po with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "cognAI/lil-c3po" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cognAI/lil-c3po", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/cognAI/lil-c3po
- SGLang
How to use cognAI/lil-c3po with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "cognAI/lil-c3po" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cognAI/lil-c3po", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "cognAI/lil-c3po" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cognAI/lil-c3po", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use cognAI/lil-c3po with Docker Model Runner:
docker model run hf.co/cognAI/lil-c3po
error in tokenizer
#2
by kz919 - opened
File "/import/snvm-sc-scratch1/kaizhaol/miniconda3/envs/coe/lib/python3.10/site-packages/sentencepiece/init.py", line 1038, in _func
raise IndexError('piece id is out of range.')
IndexError: piece id is out of range.
Hi @kz919 , thanks for bringing this to our attention.
Could you please share more information for us to replicate the error?
Thanks.
We are closing this report for now but if you have any updates on it, please feel free to re-open it or reach out to us at research@deepnight.tech
deepnight-research changed discussion status to closed