2 18 4

Haoran Zhang

zzzhr97

AI & ML interests

Lange Language Models, Large Reasoning Models

Recent Activity

updated a dataset 5 days ago

zzzhr97/Pi-Bench

published a dataset 5 days ago

zzzhr97/Pi-Bench

upvoted a paper 8 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

View all activity

Organizations

updated a dataset 5 days ago

zzzhr97/Pi-Bench

Viewer • Updated 5 days ago • 100 • 68 • 1

published a dataset 5 days ago

zzzhr97/Pi-Bench

Viewer • Updated 5 days ago • 100 • 68 • 1

upvoted a paper 8 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 11 days ago • 102

submitted a paper to Daily Papers 8 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 11 days ago • 102

authored a paper 8 days ago

$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 11 days ago • 102

authored a paper 11 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 17 days ago • 158

upvoted 2 papers 15 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 17 days ago • 158

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published 23 days ago • 26

upvoted a paper about 2 months ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 85

upvoted a paper 2 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 427

upvoted a paper 4 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

updated a model 4 months ago

zzzhr97/TRM-8B

Text Classification • 8B • Updated Feb 10 • 13

published a model 4 months ago

zzzhr97/TRM-8B

Text Classification • 8B • Updated Feb 10 • 13

updated a dataset 4 months ago

zzzhr97/TRM-Preference

Updated Feb 10 • 5

published a dataset 4 months ago

zzzhr97/TRM-Preference

Updated Feb 10 • 5

updated a dataset 4 months ago

zzzhr97/WebInstruct-Verified-Processed

Viewer • Updated Feb 10 • 233k • 8

published a dataset 4 months ago

zzzhr97/WebInstruct-Verified-Processed

Viewer • Updated Feb 10 • 233k • 8

upvoted 3 papers 4 months ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published Jan 15 • 63

Haoran Zhang

AI & ML interests

Recent Activity

Organizations

zzzhr97's activity