2 6 4

Mark

Makrrr

AI & ML interests

NLP, RLHF, IR

Recent Activity

upvoted a paper 3 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

upvoted a paper 28 days ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

updated a model about 1 month ago

CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500

View all activity

Organizations

upvoted a paper 3 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 5 days ago • 42

upvoted a paper 28 days ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 30 days ago • 46

updated a model about 1 month ago

CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500

Text Generation • 4B • Updated Apr 26 • 143

published a model about 1 month ago

CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500

Text Generation • 4B • Updated Apr 26 • 143

updated a dataset about 1 month ago

CL-From-Nothing/RLVE-Eval20-Qwen3-4B-SSD-N20-SFT-Train

Viewer • Updated Apr 26 • 16k • 29

published a dataset about 1 month ago

CL-From-Nothing/RLVE-Eval20-Qwen3-4B-SSD-N20-SFT-Train

Viewer • Updated Apr 26 • 16k • 29

updated a model about 1 month ago

CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500

Text Generation • 2B • Updated Apr 24 • 136

published a model about 1 month ago

CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500

Text Generation • 2B • Updated Apr 24 • 136

updated a dataset about 1 month ago

CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train

Viewer • Updated Apr 24 • 16k • 41

published a dataset about 1 month ago

CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train

Viewer • Updated Apr 24 • 16k • 41

updated a dataset about 2 months ago

CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384

Viewer • Updated Apr 21 • 64k • 42

published a dataset about 2 months ago

CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384

Viewer • Updated Apr 21 • 64k • 42

updated a dataset about 2 months ago

CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384

Viewer • Updated Apr 16 • 42.8k • 20

published a dataset about 2 months ago

CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384

Viewer • Updated Apr 16 • 42.8k • 20

updated a dataset about 2 months ago

CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384

Viewer • Updated Apr 16 • 3.2k • 11

published a dataset about 2 months ago

CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384

Viewer • Updated Apr 16 • 3.2k • 11

upvoted a paper about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

updated a dataset about 2 months ago

CL-From-Nothing/FrozenLake-Hard-Trajectories

Viewer • Updated Apr 12 • 8k • 48

published a dataset about 2 months ago

CL-From-Nothing/FrozenLake-Hard-Trajectories

Viewer • Updated Apr 12 • 8k • 48

updated a dataset about 2 months ago

CL-From-Nothing/Sokoban-Trajectories

Viewer • Updated Apr 12 • 8k • 82

Mark

AI & ML interests

Recent Activity

Organizations

Makrrr's activity