UniSD: Towards a Unified Self-Distillation Framework for Large Language Models Paper • 2605.06597 • Published 11 days ago • 15
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 6 days ago • 11