6 9 12

Saurabh_Sharma

SaurabhSharma220

AI & ML interests

None yet

Recent Activity

upvoted a collection 15 days ago

Gemma 4 Assistant GGUF

upvoted a collection 15 days ago

Qwen3-ASR

published a model 9 months ago

SaurabhSharma220/DPO_TinyLlama-1.1B-Chat-v1.0

View all activity

Organizations

upvoted 2 collections 15 days ago

Gemma 4 Assistant GGUF

Collection

Gemma 4 MTP assistant drafters as GGUF (F16/Q8_0/Q5_K_M/Q4_K_M/Q4_K_S). Speculative-decoding heads for the atomic-llama-cpp-turboquant fork. • 4 items • Updated 22 days ago • 11

Qwen3-ASR

Collection

4 items • Updated Jan 29 • 66

upvoted a collection 10 months ago

Gemma 3 Release

Collection

28 items • Updated Mar 12 • 638

upvoted 4 articles 10 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Xenova, pcuenq, reach-vb, joaogante

•

Jul 31, 2024

• 60

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

smangrul, sayakpaul

•

Feb 10, 2023

• 119

Article

Fine-Tuning Gemma Models in Hugging Face

svaibhav, alanwaketan, ybelkada, ArthurZ

•

Feb 23, 2024

• 46

upvoted 2 papers about 1 year ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 249

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 67

Saurabh_Sharma

AI & ML interests

Recent Activity

Organizations

SaurabhSharma220's activity

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Fine-Tuning Gemma Models in Hugging Face