WangWenxuan

Yummytanmo

Yummytanmo

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

upvoted a paper 25 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

upvoted a paper 25 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

View all activity

Organizations

None yet

upvoted a paper 2 days ago

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

Paper • 2607.02440 • Published 4 days ago • 43

upvoted 3 papers 25 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 27 days ago • 19

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Paper • 2606.05761 • Published Jun 4 • 19

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted a paper 5 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

upvoted a paper 7 months ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published Nov 25, 2025 • 27