AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation Paper • 2604.18240 • Published 9 days ago • 15
Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models Paper • 2506.11487 • Published Jun 13, 2025 • 3
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published Aug 6, 2025 • 9
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published Aug 6, 2025 • 9
view article Article Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models Jul 10, 2025 • 55
Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models Paper • 2506.11487 • Published Jun 13, 2025 • 3