view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 483
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 402
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 246
view article Article What's going on with the Open LLM Leaderboard? +2 clefourrier, SaylorTwift, slippylolo, thomwolf • Jun 23, 2023 • 51
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published Nov 25, 2025 • 37
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published Dec 3, 2025 • 24
Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers Paper • 2509.24317 • Published Sep 29, 2025 • 11
view article Article Timm ❤️ Transformers: Use any timm model with transformers +3 ariG23498, rwightman, qubvel-hf, pcuenq, reach-vb • Jan 16, 2025 • 55
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer Paper • 2504.11289 • Published Apr 15, 2025 • 2
view article Article 17 Reasons Why Gradio Isn't Just Another UI Library ysharma, abidlabs • Apr 16, 2025 • 44
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published Feb 4, 2025 • 66
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 674