Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation Paper • 2605.29861 • Published 2 days ago • 6
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published 3 days ago • 16
LoMo: Local Modality Substitution for Deeper Vision-Language Fusion Paper • 2605.30265 • Published 2 days ago • 17
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 2 days ago • 19
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 2 days ago • 30
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published 2 days ago • 3
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 2 days ago • 68
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 2 days ago • 37
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents Paper • 2605.29559 • Published 2 days ago • 7
AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security Paper • 2605.29801 • Published 2 days ago • 80
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 2 days ago • 52
Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization Paper • 2605.26457 • Published 4 days ago • 3
OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration Paper • 2605.28805 • Published 3 days ago • 8
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes Paper • 2605.28421 • Published 3 days ago • 43
Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments Paper • 2605.27209 • Published 4 days ago • 11
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning Paper • 2605.24486 • Published 7 days ago • 4
ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations Paper • 2605.27908 • Published 3 days ago • 3
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 3 days ago • 78
Rethinking Memory as Continuously Evolving Connectivity Paper • 2605.28773 • Published 3 days ago • 22