12 21

Genius Patrick

geniuspatrick

AI & ML interests

None yet

Recent Activity

liked a Space about 18 hours ago

HuggingFaceM4/encoder-free-vlm

liked a Space about 2 months ago

AdithyaSK/rl-environments-guide

liked a Space about 2 months ago

Xenova/the-tokenizer-playground

View all activity

Organizations

None yet

liked a Space about 18 hours ago

Encoder-Free VLM

👁

Train Your Own Encoder-Free VLM in $100

liked 2 Spaces about 2 months ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

191

Building and scaling RL environments for LLM training

The Tokenizer Playground

📝

678

Experiment with and compare different tokenizers

upvoted an article 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 164

liked 2 Spaces 2 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

114

Explore on-policy distillation visualization for any model

Distilling 100B+ Models 40x Faster with TRL

📝

TRL distillation for 100B+ teachers, 40x faster

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

liked a Space 3 months ago

Open VLM Leaderboard

🌎

1.02k

VLMEvalKit Evaluation Results Collection

upvoted 3 articles 3 months ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

Article

视觉语言模型 (更好、更快、更强)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 17

liked a model 3 months ago

HuggingFaceTB/qwen3-1.7b-gsm8k-sft

Text Generation • 2B • Updated Mar 25 • 539 • • 3

liked a Space 3 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

261

Visualize synthetic‑data experiments as an interactive bookshelf

liked 3 datasets 3 months ago

upvoted a collection 3 months ago

Qwen3.5-Claude-4.6-Opus-Reasoning-Distilled

Collection

18 items • Updated May 23 • 212

liked a Space 4 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted an article 4 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 169

upvoted an article 5 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

Genius Patrick

AI & ML interests

Recent Activity

Organizations

geniuspatrick's activity

Encoder-Free VLM

The ultimate guide to RL environments: building and scaling them in the LLM era

The Tokenizer Playground

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Unlocking On-Policy Distillation for Any Model Family

Distilling 100B+ Models 40x Faster with TRL

Welcome Gemma 4: Frontier multimodal intelligence on device

Open VLM Leaderboard

Introducing smolagents: simple agents that write actions in code.

Vision Language Models (Better, faster, stronger)

视觉语言模型 (更好、更快、更强)

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Mixture of Experts (MoEs) in Transformers

Transformers v5: Simple model definitions powering the AI ecosystem