Cola Chen (SII)'s picture

Cola Chen (SII)

141forever

·

https://141forever.github.io/

141forever

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

upvoted a paper 1 day ago

POISE: Position-Aware Undetectable Skill Injection on LLM Agents

new activity 3 months ago

HuggingFaceH4/on-policy-distillation:How to reproduce the results in your blog?

View all activity

Organizations

upvoted a paper about 6 hours ago

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

Paper • 2606.11662 • Published 3 days ago • 9

upvoted a paper 1 day ago

POISE: Position-Aware Undetectable Skill Injection on LLM Agents

Paper • 2606.07943 • Published 7 days ago • 4

New activity in HuggingFaceH4/on-policy-distillation 3 months ago

How to reproduce the results in your blog?

#7 opened 4 months ago by

liked a Space 4 months ago

Unlocking On-Policy Distillation for Any Model Family

Explore on-policy distillation visualization for any model

New activity in HuggingFaceH4/on-policy-distillation 4 months ago

About lr and evaluation

#6 opened 5 months ago by

upvoted an article 7 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

ServiceNow-AI

•

Nov 19, 2025

• 34

upvoted a paper 8 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16, 2025 • 34