Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper 6 days ago

Qwen3.5-Omni Technical Report

upvoted a paper 9 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

upvoted a paper 9 days ago

EXAONE 4.5 Technical Report

View all activity

Organizations

upvoted a paper 6 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 10 days ago • 55

upvoted 2 papers 9 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 25 days ago • 46

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published 18 days ago • 66

liked a model 18 days ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 16 days ago • 6.04M • • 2.37k

liked a model 19 days ago

google/gemma-4-26B-A4B-it

Image-Text-to-Text • 27B • Updated 16 days ago • 4.5M • • 813

upvoted a paper 19 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 24 days ago • 166

upvoted a paper 21 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 343

upvoted a paper 24 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 54

upvoted 3 papers about 1 month ago

liked a model about 1 month ago

Qwen/Qwen3.5-27B

Image-Text-to-Text • 28B • Updated 2 days ago • 3.35M • • 962

upvoted 2 papers about 1 month ago

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published Mar 10 • 75

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

upvoted a paper about 2 months ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 264

upvoted 2 papers 2 months ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

liked 3 models 2 months ago

allenai/OLMoE-1B-7B-0125

Text Generation • 7B • Updated Mar 16, 2025 • 14.5k • 35

deepseek-ai/deepseek-moe-16b-base

Text Generation • 16B • Updated Jan 12, 2024 • 30.4k • 145

Qwen/Qwen1.5-MoE-A2.7B

Text Generation • 14B • Updated Apr 18, 2024 • 145k • 224

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity