Qwen/Qwen3-VL-8B-Thinking — text-only (Qwen3 causal LM)

Text-only checkpoint extracted from the language-model tower of Qwen/Qwen3-VL-8B-Thinking and rewritten as a standard model_type: qwen3 / Qwen3ForCausalLM checkpoint: the vision and aligner modules are dropped and model.language_model.* keys are renamed to model.*.

Intended as a text-SFT base so trainers instantiate a plain Qwen3 causal LM instead of the full Qwen3VLForConditionalGeneration class.

Built with the deepswe converters sft/qwen3/scripts/prepare_qwen3_vl_text_checkpoint.py + save_qwen3_vl_text_as_qwen3_checkpoint.py.

Downloads last month: 12

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for eewer/qwen3-vl-8b-thinking-text

Base model

Qwen/Qwen3-VL-8B-Thinking

Finetuned

(66)

this model