Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility Paper • 2605.06105 • Published 5 days ago • 1
Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning Paper • 2508.18395 • Published Aug 25, 2025