Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 21
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution? Paper • 2505.20295 • Published May 26, 2025 • 1
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models Paper • 2602.04649 • Published Feb 4 • 13
Soft Self-Consistency Improves Language Model Agents Paper • 2402.13212 • Published Feb 20, 2024 • 2
Nuance Matters: Probing Epistemic Consistency in Causal Reasoning Paper • 2409.00103 • Published Aug 27, 2024 • 1
Beyond External Monitors: Enhancing Transparency of Large Language Models for Easier Monitoring Paper • 2502.05242 • Published Feb 7, 2025 • 1
Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving Paper • 2501.02348 • Published Jan 4, 2025 • 1
Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads Paper • 2511.06209 • Published Nov 9, 2025 • 20
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives Paper • 2401.02009 • Published Jan 4, 2024 • 2
The Trickle-down Impact of Reward (In-)consistency on RLHF Paper • 2309.16155 • Published Sep 28, 2023 • 2
MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking Paper • 2501.13117 • Published Jan 20, 2025 • 1
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 35
Toward Adaptive Reasoning in Large Language Models with Thought Rollback Paper • 2412.19707 • Published Dec 27, 2024 • 1
Improved Techniques for Training Consistency Models Paper • 2310.14189 • Published Oct 22, 2023 • 1
Are LLMs classical or nonmonotonic reasoners? Lessons from generics Paper • 2406.06590 • Published Jun 5, 2024 • 1
Stable Consistency Tuning: Understanding and Improving Consistency Models Paper • 2410.18958 • Published Oct 24, 2024 • 11
Metacognitive Prompting Improves Understanding in Large Language Models Paper • 2308.05342 • Published Aug 10, 2023 • 3