mlx-community/deepseek-ai-DeepSeek-V4-Flash-8bit Text Generation • 284B • Updated 6 days ago • 18.9k • 11
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 9 days ago • 237
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 29 days ago • 498
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 28 days ago • 7
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning Paper • 2603.26653 • Published Mar 27 • 18
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 349