arxiv:2604.14683
Qianqian Xie
mistletoe111
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation upvoted a paper 5 days ago
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models updated a dataset 5 days ago
NJU-LINK/DR3-Eval