ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 29 days ago • 42
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 10
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 10