Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation Paper • 2512.21002 • Published Dec 24, 2025
Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems Paper • 2502.14305 • Published Oct 26, 2025
PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models Paper • 2602.04029 • Published Feb 3
CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations Paper • 2502.15132 • Published Feb 21, 2025
To Think or Not to Think: The Hidden Cost of Meta-Training with Excessive CoT Examples Paper • 2512.05318 • Published Dec 4, 2025 • 3
Liger Kernel: Efficient Triton Kernels for LLM Training Paper • 2410.10989 • Published Oct 14, 2024 • 3
Randomized Schur Complement Views for Graph Contrastive Learning Paper • 2306.04004 • Published Jun 6, 2023