hill_8k_300

This repository contains the global_step_300 actor checkpoint from the HiLL 8k run, converted from verl FSDP shards to Hugging Face Transformers format.

Source

  • Base model: meta-llama/Llama-3.2-3B-Instruct
  • Training run: HiLL-Llama-3.2-3B-Instruct-8k
  • Checkpoint: global_step_300/actor
  • Conversion backend: verl FSDP model merger

Loading

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "sagnikM/hill_8k_300"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, dtype=torch.bfloat16)
Downloads last month
1,553
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sagnikM/hill_8k_300

Finetuned
(1674)
this model