Samuel PRO

Zenalyze

AI & ML interests

Compression

Recent Activity

updated a model about 6 hours ago

fraQtl/Llama-3.2-3B-optimized

updated a model about 6 hours ago

fraQtl/Qwen-2.5-3B-optimized

updated a model about 6 hours ago

fraQtl/TinyLlama-1.1B-optimized

View all activity

Organizations

updated 6 models about 6 hours ago

updated a model 1 day ago

fraQtl/Qwen2.5-7B-GGUF

Text Generation • 8B • Updated 1 day ago • 137

published a model 1 day ago

fraQtl/Qwen2.5-7B-GGUF

Text Generation • 8B • Updated 1 day ago • 137

published a model 9 days ago

fraQtl/Qwen3.6-35B-A3B-compressed

Text Generation • Updated about 6 hours ago • 314 • 1

replied to danielhanchen's post 9 days ago

We just compressed Qwen 3.6's KV cache 4x with zero quality loss (PPL actually improves slightly).

Works automatically on the hybrid architecture — detects standard vs linear attention layers.

Model card: huggingface.co/fraQtl/Qwen3.6-35B-A3B-fraQtl-kv :)

updated a Space 9 days ago

README

🚀

published a Space 9 days ago

README

🚀

published a Space 10 days ago

fraQtl — Compressed LLM Demo

⚡

Generate text and test retrieval with a compressed Mistral‑7B

published a model 10 days ago

fraQtl/Mistral-7B-compressed

Updated about 6 hours ago • 147

updated a Space 10 days ago

fraQtl — Compressed LLM Demo

⚡

Generate text and test retrieval with a compressed Mistral‑7B

published a model 12 days ago

fraQtl/Mistral-7B-fraqtl

7B • Updated about 6 hours ago • 87

published 2 models 15 days ago

fraQtl/Qwen-2.5-3B-optimized

3B • Updated about 6 hours ago • 38

fraQtl/Llama-3.2-3B-optimized

3B • Updated about 6 hours ago • 44

published a model 16 days ago

fraQtl/TinyLlama-1.1B-optimized

1B • Updated about 6 hours ago • 49

Samuel PRO

AI & ML interests

Recent Activity

Organizations

Zenalyze's activity

README

README

fraQtl — Compressed LLM Demo

fraQtl — Compressed LLM Demo