A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL Paper • 2606.02398 • Published 4 days ago • 26
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 48