Commit History

Add generation_config.json (eos_token_id=7) for Transformers.js
f954820
verified

shreyask commited on

Set kv_cache_dtype + torch_dtype to float32 (matches actual ONNX weights)
c167f01
verified

shreyask commited on

Drop dangling model.onnx ref from use_external_data_format
c524ad2
verified

shreyask commited on

Patch ONNX layout + config for Transformers.js (onnx/model_q4)
aabf777
verified

shreyask commited on

Upload config.json
ee6691d
verified

shreyask commited on

Upload tokenizer_config.json
450ce47
verified

shreyask commited on

Upload decoder ONNX model (int4) via onnxruntime-genai
9181100
verified

shreyask commited on

initial commit
b87fba2
verified

shreyask commited on