SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills Paper • 2605.24117 • Published 8 days ago • 17
CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language Models Paper • 2601.13622 • Published Jan 20 • 1
Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization Paper • 2604.07343 • Published Apr 8 • 13
Learning to Self-Verify Makes Language Models Better Reasoners Paper • 2602.07594 • Published Feb 7 • 3
Collaborative Multi-Agent Optimization for Personalized Memory System Paper • 2603.12631 • Published Mar 13
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 23 days ago • 111
ModelLens: Finding the Best for Your Task from Myriads of Models Paper • 2605.07075 • Published 22 days ago • 15
ModelLens: Finding the Best for Your Task from Myriads of Models Paper • 2605.07075 • Published 22 days ago • 15
NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results Paper • 2506.02875 • Published Jun 3, 2025
Trade-offs in Image Generation: How Do Different Dimensions Interact? Paper • 2507.22100 • Published Jul 29, 2025
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published Apr 13 • 72
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published Apr 13 • 72
Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization Paper • 2604.07343 • Published Apr 8 • 13
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images Paper • 2603.02210 • Published Mar 2 • 29
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images Paper • 2603.02210 • Published Mar 2 • 29
DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning Paper • 2602.19895 • Published Feb 23 • 14
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published Feb 11 • 31
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Paper • 2602.03442 • Published Feb 3 • 21
Reasoning-Enhanced Large Language Models for Molecular Property Prediction Paper • 2510.10248 • Published Oct 11, 2025 • 2