Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 20 days ago • 63
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving Paper • 2601.00747 • Published Jan 2 • 20
The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving Paper • 2601.00747 • Published Jan 2 • 20
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation Paper • 2310.02003 • Published Oct 2, 2023