view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 514
view article Article Train 400x faster Static Embedding Models with Sentence Transformers tomaarsen • Jan 15, 2025 • 230
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6, 2024 • 68
view article Article Our Transformers Code Agent beats the GAIA benchmark 🏅 m-ric, sergeipetrov • Jul 1, 2024 • 100
Evaluating D-MERIT of Partial-annotation on Information Retrieval Paper • 2406.16048 • Published Jun 23, 2024 • 36
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent +2 qgallouedec, edbeeching, ClementRomac, thomwolf • Apr 22, 2024 • 81
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 119
Model Merging Papers Collection Collection of relevant papers about model merging • 13 items • Updated Apr 2, 2024 • 6