Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 14 days ago • 58
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 15 days ago • 125
The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer Paper • 2602.02557 • Published 26 days ago • 21
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing Paper • 2605.25893 • Published about 1 month ago • 39
Forecasting Scientific Progress with Artificial Intelligence Paper • 2605.22681 • Published May 21 • 45
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks Paper • 2511.15065 • Published Nov 19, 2025 • 78
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published Sep 1, 2025 • 58
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published Jan 8, 2025 • 35