🏗️ Building on HF

Dipankar Sarkar PRO

dipankarsarkar

https://www.dipankar.cc

AI & ML interests

Building the AI-native stack. Agents as infrastructure, safety as architecture, performance as plumbing. I publish the receipts: papers, datasets, demos.

Recent Activity

reacted to stas's post with 🤗 about 5 hours ago

I present to you a new experimental open book. https://github.com/stas00/python-cookbook I took my dense Python cheatsheet that I have been honing for many years and use a lot daily and turned it into a book of recipes. Is this useful? This is, of course, free, like other open books.

reacted to salma-remyx's post with 🔥 about 5 hours ago

What's holding your code back? Outrider finds, implements, and validates methods for your repo. While testing Outrider on a fork of huggingface/peft, I discovered "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models" (arxiv: 2402.02347) The work offers improved stability and faster convergence in LoRA finetuning by adjusting updates for curvature that LoRA optimizers typically ignore. Not the most recent paper, so I was pleasantly surprised my action surfaced this method as a candidate before implementing a PR. Even more surprised this method had not already been merged upstream. Turns out, the author did try contributing to peft a couple years ago, but people get busy and the PR was closed after going stale. So I decided to revive it! I opened an issue and soon after the author engaged to help land the feature. Now huggingface/peft #3382 is open, a joint effort with the paper's author. This whole episode has me thinking about the future of OSS maintenance with AI coding. The software projects which endure will be well-shaped to quickly land and help test new ideas. Across 30 forks, I've seen several papers land as clean PRs for multiple repos, which offers a perspective on how methods impact applications. Recent methods matching multiple frameworks: STARE, Entity Binding, BINEVAL Get Outrider: https://github.com/remyxai/outrider

upvoted a paper about 10 hours ago

AI translation of literary texts is "fine", but readers still prefer human translations

View all activity

Organizations

upvoted a paper about 10 hours ago

AI translation of literary texts is "fine", but readers still prefer human translations

Paper • 2606.26040 • Published 9 days ago • 3

upvoted a paper about 12 hours ago

Are Performance-Optimization Benchmarks Reliably Measuring Coding Agents?

Paper • 2607.01211 • Published 2 days ago • 4

upvoted 2 papers about 13 hours ago

GenericAgent: A Token-Efficient Self-Evolving LLM Agent via Contextual Information Density Maximization (V1.0)

Paper • 2604.17091 • Published Apr 18 • 23

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Paper • 2511.12884 • Published Nov 17, 2025 • 29

upvoted 2 papers about 14 hours ago

AtomiMed: Hierarchical Atomic Fact-Checking for Universal Clinical-Aware Medical Report Evaluation

Paper • 2606.31292 • Published 3 days ago • 4

Cross-Domain Generalization Failure in Lightweight Intrusion Detection Models for IIoT Networks

Paper • 2607.00553 • Published 2 days ago • 5

upvoted 2 papers about 15 hours ago

The State-Prediction Separation Hypothesis

Paper • 2607.01218 • Published 2 days ago • 7

CausalMix: Data Mixture as Causal Inference for Language Model Training

Paper • 2607.01104 • Published 2 days ago • 14

upvoted 4 papers about 16 hours ago

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Paper • 2606.28322 • Published 7 days ago • 35

RepoRescue: An Empirical Study of LLM Agents on Whole-Repository Compatibility Rescue

Paper • 2607.01213 • Published 2 days ago • 2

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

Paper • 2606.32017 • Published 3 days ago • 7

SWE-INTERACT: Reimagining SWE Benchmarks as User-Driven Long-Horizon Coding Sessions

Paper • 2606.30573 • Published 4 days ago • 4

upvoted 2 papers about 17 hours ago

Graph-Native Reinforcement Learning Enables Traceable Scientific Hypothesis Generation through Conceptual Recombination

Paper • 2607.00924 • Published 2 days ago • 4

AutoTrainess: Teaching Language Models to Improve Language Models Autonomously

Paper • 2606.31551 • Published 3 days ago • 8

upvoted 4 papers about 20 hours ago

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

Paper • 2606.32029 • Published 3 days ago • 4

upvoted 2 papers 1 day ago

Hierarchical Experimentalist Agents

Paper • 2606.29315 • Published 5 days ago • 2

Are We Measuring Strategy or Phrasing? The Gap Between Surface- and Approach-Level Diversity in LLM Math Reasoning

Paper • 2606.29985 • Published 4 days ago • 16

Dipankar Sarkar PRO

AI & ML interests

Recent Activity

Organizations

dipankarsarkar's activity