Text Generation
Safetensors
English
Chinese
qwen3
reward-model
rlhf
principle-following
qwen
conversational
Instructions to use WisdomShell/RewardAnything-8B-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference
| { | |
| "_from_model_config": true, | |
| "eos_token_id": 151645, | |
| "pad_token_id": 151643, | |
| "transformers_version": "4.51.3" | |
| } | |