7 10

Jimmy

bigheiniuJ

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

upvoted a paper 25 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

liked a dataset about 1 month ago

SALT-NLP/SWE-chat

View all activity

Organizations

upvoted a paper 1 day ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published 10 days ago • 32

upvoted a paper 25 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 109

liked a dataset about 1 month ago

SALT-NLP/SWE-chat

Viewer • Updated Apr 29 • 2.73M • 3.37k • 60

upvoted a collection 4 months ago

Agent World Model

Collection

4 items • Updated Feb 11 • 9

liked a dataset 4 months ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 7.42k • 235

upvoted a paper about 1 year ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 89

upvoted a paper over 1 year ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12, 2025 • 20

liked a dataset over 1 year ago

mlabonne/chatml_dpo_pairs

Viewer • Updated Apr 11, 2024 • 12.9k • 84 • 55

updated 9 models over 1 year ago

updated 2 datasets over 1 year ago

bigheiniuJ/ultrafeedback_feedback_norandom

Viewer • Updated Dec 24, 2024 • 61.1k • 7

bigheiniuJ/ultrafeedback_feedback_noshort

Viewer • Updated Dec 24, 2024 • 61.1k • 11

updated a model over 1 year ago

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen

Text Generation • 7B • Updated Dec 24, 2024 • 1

Jimmy

AI & ML interests

Recent Activity

Organizations

bigheiniuJ's activity