view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • about 18 hours ago • 14
JetBrains/Mellum2-12B-A2.5B-Base-Pretrain Text Generation • 12B • Updated about 22 hours ago • 22 • 5
JetBrains/Mellum2-12B-A2.5B-Base Text Generation • 12B • Updated about 22 hours ago • 188 • 8
JetBrains/Mellum2-12B-A2.5B-Instruct-SFT Text Generation • 12B • Updated about 22 hours ago • 35 • 5
JetBrains/Mellum2-12B-A2.5B-Thinking-SFT Text Generation • 12B • Updated about 22 hours ago • 34 • 12
JetBrains/Mellum2-12B-A2.5B-Instruct Text Generation • 12B • Updated about 22 hours ago • 128 • 23
JetBrains/Mellum2-12B-A2.5B-Thinking Text Generation • 12B • Updated about 18 hours ago • 799 • 76
JetBrains/Mellum2-12B-A2.5B-Base-Pretrain Text Generation • 12B • Updated about 22 hours ago • 22 • 5
On Problems of Implicit Context Compression for Software Engineering Agents Paper • 2605.11051 • Published 22 days ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation Paper • 2510.23393 • Published Oct 27, 2025 • 21
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation Paper • 2510.23393 • Published Oct 27, 2025 • 21
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management Paper • 2508.21433 • Published Aug 29, 2025 • 8