dar-tau (Guy Dar)

upvoted an article 4 months ago

Article

Friends and Grandmothers in Silico

tux

•

Jan 24

• 10

upvoted an article 5 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 624

upvoted a paper 7 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 514

upvoted an article 7 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

upvoted a paper 7 months ago

LongCodeZip: Compress Long Context for Code Language Models

Paper • 2510.00446 • Published Oct 1, 2025 • 108

upvoted a paper 10 months ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

upvoted an article about 1 year ago

Article

Our Transformers Code Agent beats the GAIA benchmark 🏅

m-ric, sergeipetrov

•

Jul 1, 2024

• 100

upvoted a paper almost 2 years ago

Evaluating D-MERIT of Partial-annotation on Information Retrieval

Paper • 2406.16048 • Published Jun 23, 2024 • 36

upvoted an article about 2 years ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

+2

qgallouedec, edbeeching, ClementRomac, thomwolf

•

Apr 22, 2024

• 81

upvoted a paper about 2 years ago

In-context Learning and Gradient Descent Revisited

Paper • 2311.07772 • Published Nov 13, 2023 • 2

upvoted 5 collections about 2 years ago

Guy Dar

AI & ML interests

Organizations

Friends and Grandmothers in Silico

We Got Claude to Fine-Tune an Open Source LLM

Less is More: Recursive Reasoning with Tiny Networks

Train 400x faster Static Embedding Models with Sentence Transformers

LongCodeZip: Compress Long Context for Code Language Models

The Prompt Report: A Systematic Survey of Prompting Techniques

Our Transformers Code Agent beats the GAIA benchmark 🏅

Evaluating D-MERIT of Partial-annotation on Information Retrieval

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

In-context Learning and Gradient Descent Revisited

Large Language models and Knowledge graphs

Popular Spaces

🔍 Interpretability & Analysis of LMs

Model Merging Papers

💫 StarCoder2

Guy Dar

AI & ML interests

Organizations

dar-tau's activity

Friends and Grandmothers in Silico

We Got Claude to Fine-Tune an Open Source LLM

Train 400x faster Static Embedding Models with Sentence Transformers

Our Transformers Code Agent beats the GAIA benchmark 🏅

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent