ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a model about 3 hours ago

deepreinforce-ai/Ornith-1.0-397B

upvoted a paper 5 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

upvoted a paper 11 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

View all activity

Organizations

upvoted a paper 5 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 14 days ago • 119

upvoted a paper 11 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Paper • 2606.18023 • Published 13 days ago • 207

upvoted 2 articles 11 days ago

Article

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

ariG23498, ror, sergiopaniego, pcuenq, sayakpaul

•

18 days ago

• 49

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

May 29

• 129

upvoted a paper 16 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 18 days ago • 148

upvoted a collection 24 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 17 days ago • 168

upvoted an article 25 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 165

upvoted a collection 2 months ago

Qwen3.6

Collection

4 items • Updated Apr 22 • 420

upvoted 2 papers 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published Mar 26 • 19

upvoted an article 3 months ago

Article

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

Sri-Vigneshwar-DJ

•

Jan 4, 2025

• 9

upvoted a collection 3 months ago

UltraData

Collection

Ultra Scale, Ultra Quality, Ultra Coverage • 11 items • Updated May 28 • 98

upvoted a paper 3 months ago

Data Science and Technology Towards AGI Part I: Tiered Data Management

Paper • 2602.09003 • Published Feb 9 • 8

upvoted a paper 4 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

upvoted a collection 4 months ago

Open Coding Agents Specialization

Collection

Ai2 Open Coding Agents - Django, Sphinx, Sympy Data • 6 items • Updated Feb 11 • 6

upvoted 4 papers 4 months ago

upvoted an article 5 months ago

Article

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

codelion

•

Jun 27, 2025

• 26

ldwang

AI & ML interests

Recent Activity

Organizations

ldwang's activity

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

Automated Discovery of High-Performance GPU Kernels with OpenEvolve