Ran Zilberstein

RanZilberstein-Nvidia

AI & ML interests

None yet

Recent Activity

published an article about 2 months ago

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

liked a model 3 months ago

nvidia/DeepSeek-R1-NVFP4

liked a model 10 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

View all activity

Organizations

published an article about 2 months ago

Article

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

nvidia

•

Mar 19

• 47

liked a model 3 months ago

nvidia/DeepSeek-R1-NVFP4

Text Generation • 397B • Updated Jun 6, 2025 • 9.63k • 278

liked 2 models 10 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15, 2025 • 147k • 28

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Oct 15, 2025 • 32.3k • 233

liked a model about 1 year ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • Updated Oct 15, 2025 • 1.78k • • 347

authored a paper about 1 year ago

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published Mar 24, 2025 • 20

liked 2 models about 1 year ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated Oct 15, 2025 • 16.5k • • 222

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • 50B • Updated Oct 15, 2025 • 46.7k • 323

authored a paper over 1 year ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 20

liked a model over 1 year ago

nvidia/Llama-3_1-Nemotron-51B-Instruct

Text Generation • Updated Jul 6, 2025 • 1.29k • 210

updated a Space almost 2 years ago

README

👀

Ran Zilberstein

AI & ML interests

Recent Activity

Organizations

RanZilberstein-Nvidia's activity

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

README