Thinking About Thinking: Evaluating Reasoning in Post-Trained Language Models Paper โข 2510.16340 โข Published Oct 18, 2025 โข 8