A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Paper • 2503.07137 • Published Mar 10, 2025 • 1
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published 5 days ago • 37
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 4 days ago • 227
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs Paper • 2502.11880 • Published Feb 17, 2025 • 18
PyTorch Distributed: Experiences on Accelerating Data Parallel Training Paper • 2006.15704 • Published Jun 28, 2020 • 8
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs Paper • 2508.04660 • Published Aug 6, 2025 • 3
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 5 days ago • 82
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 17 days ago • 100
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 6 days ago • 41
Efficient Memory Management for Large Language Model Serving with PagedAttention Paper • 2309.06180 • Published Sep 12, 2023 • 54
Geometric Context Transformer for Streaming 3D Reconstruction Paper • 2604.14141 • Published 11 days ago • 18