Cho Sung-min's picture

Cho Sung-min

smurpes

·

AI & ML interests

None yet

Recent Activity

liked a model about 23 hours ago

openai/clip-vit-base-patch32

liked a model 1 day ago

stabilityai/stable-diffusion-3-medium

upvoted a paper 2 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

View all activity

Organizations

None yet

upvoted a paper 2 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 7 days ago • 56

upvoted a paper 3 days ago

Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering

Paper • 2605.29648 • Published 8 days ago • 10

upvoted a paper 13 days ago

Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding

Paper • 2605.20104 • Published 17 days ago • 7

upvoted a paper 16 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 24 days ago • 195

upvoted 2 papers 25 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 29 days ago • 233

InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search

Paper • 2605.07510 • Published 28 days ago • 6

upvoted 6 papers about 2 months ago

Paper Espresso: From Paper Overload to Research Insight

Paper • 2604.04562 • Published Apr 6 • 13

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 632

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published Apr 2 • 32

upvoted a paper 2 months ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published Mar 30 • 58

upvoted 5 papers 3 months ago

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Paper • 2603.19228 • Published Mar 19 • 68

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

upvoted 2 papers 4 months ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 245