🏗️ Building on HF

Dipankar Sarkar PRO

dipankarsarkar

https://www.dipankar.cc

AI & ML interests

Building the AI-native stack. Agents as infrastructure, safety as architecture, performance as plumbing. I publish the receipts: papers, datasets, demos.

Recent Activity

reacted to stas's post with 🤗 about 5 hours ago

I present to you a new experimental open book. https://github.com/stas00/python-cookbook I took my dense Python cheatsheet that I have been honing for many years and use a lot daily and turned it into a book of recipes. Is this useful? This is, of course, free, like other open books.

reacted to salma-remyx's post with 🔥 about 5 hours ago

What's holding your code back? Outrider finds, implements, and validates methods for your repo. While testing Outrider on a fork of huggingface/peft, I discovered "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models" (arxiv: 2402.02347) The work offers improved stability and faster convergence in LoRA finetuning by adjusting updates for curvature that LoRA optimizers typically ignore. Not the most recent paper, so I was pleasantly surprised my action surfaced this method as a candidate before implementing a PR. Even more surprised this method had not already been merged upstream. Turns out, the author did try contributing to peft a couple years ago, but people get busy and the PR was closed after going stale. So I decided to revive it! I opened an issue and soon after the author engaged to help land the feature. Now huggingface/peft #3382 is open, a joint effort with the paper's author. This whole episode has me thinking about the future of OSS maintenance with AI coding. The software projects which endure will be well-shaped to quickly land and help test new ideas. Across 30 forks, I've seen several papers land as clean PRs for multiple repos, which offers a perspective on how methods impact applications. Recent methods matching multiple frameworks: STARE, Entity Binding, BINEVAL Get Outrider: https://github.com/remyxai/outrider

upvoted a paper about 10 hours ago

AI translation of literary texts is "fine", but readers still prefer human translations

View all activity

Organizations

reacted to stas's post with 🤗 about 5 hours ago

Post

I present to you a new experimental open book.

https://github.com/stas00/python-cookbook

I took my dense Python cheatsheet that I have been honing for many years and use a lot daily and turned it into a book of recipes.

Is this useful?

This is, of course, free, like other open books.

reacted to salma-remyx's post with 🔥 about 5 hours ago

Post

What's holding your code back?
Outrider finds, implements, and validates methods for your repo.

While testing Outrider on a fork of huggingface/peft, I discovered "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models" (arxiv: 2402.02347)

The work offers improved stability and faster convergence in LoRA finetuning by adjusting updates for curvature that LoRA optimizers typically ignore.

Not the most recent paper, so I was pleasantly surprised my action surfaced this method as a candidate before implementing a PR. Even more surprised this method had not already been merged upstream.

Turns out, the author did try contributing to peft a couple years ago, but people get busy and the PR was closed after going stale.

So I decided to revive it! I opened an issue and soon after the author engaged to help land the feature. Now huggingface/peft #3382 is open, a joint effort with the paper's author.

This whole episode has me thinking about the future of OSS maintenance with AI coding. The software projects which endure will be well-shaped to quickly land and help test new ideas.

Across 30 forks, I've seen several papers land as clean PRs for multiple repos, which offers a perspective on how methods impact applications. Recent methods matching multiple frameworks: STARE, Entity Binding, BINEVAL

Get Outrider: https://github.com/remyxai/outrider

replied to kanaria007's post about 15 hours ago

"Missed-path incidents" is carrying the whole thing.

Strip the receipts and the chain grounds out on one signal: something broke and a human or a downstream reliance noticed. Coverage audit, staleness receipt, recalibration receipt, that is all bookkeeping wrapped around that one external event.

Which is fine as an audit trail. But it is lagging by construction. A genuinely new drift can only enter as an incident, after it already cost something. Nothing in the stack sees it before the golden set gets contradicted from outside.

So I would not call it detection. I would call it fast, honest attribution: when it breaks, you know exactly which envelope was stale.

Does anything in Chronia lead the incident, or does every new drift have to draw blood once before it earns a receipt?

replied to mmhamdy's post about 16 hours ago

The name hides what actually transfers. It is not knowledge, it is the geometry of the teacher's uncertainty. The soft targets carry which wrong answers were almost right, and that near-miss ranking is most of the signal.

So I would call it confidence transfer, or uncertainty copying. It reframes the failure mode too. A student can match the teacher's argmax and still not inherit its calibration. It learns where the teacher points, not how sure it was.

Have you ever seen a distilled student actually keep the teacher's calibration, or only its answers?

reacted to mmhamdy's post with 🧠 about 16 hours ago

Post

271

It has been more than a decade now since the knowledge distillation paper came out.

Knowledge Distillation (KD) is one of my favorite topics, but I have to confess that I'm not a huge fan of the term because I find it confusing (or at least, it has became so over time).

The idea behind KD is not novel; it was there almost a decade before the paper came out (and arguably even a decade before that, back to 1990-91). But this paper is the one that clicked, the one that made the topic much more popular and introduced it to a broader audience.

First, the timing and the authors played a big role: we have Geoffrey Hinton, Oriol Vinyals, and Jeff Dean here. And second, Geoffrey Hinton is really good at idea branding: Model compression?! No, no, no! Let's call it "Knowledge Distillation" and use evocative terms such as "Dark Knowledge" to describe what is being transferred.

It's a great name, but as time has passed, the term became a bit of a relic. KD is no longer solely about compression (KD used to be introduced as a method for model compression, but now model compression is just one application of KD). And the other thing is that the word "distillation" implies some sort of potency here, that the student is somehow more powerful than the teacher, which is not the case (but many counterarguments could be made, for example, more powerful compared to another model trained with no teacher)

Nevertheless, the paper is incredibly well-written, short, and fun to read. It's one of few papers that I read several times. Check it out, and maybe share your thoughts on the topic with us here!

If you had to choose another name for Knowledge Distillation, what would it be?

6 replies

replied to breitburg's post about 16 hours ago

Ha. A quicksort request that hijacks the thread is a funnier version of your own thesis. Refusing the derailment is a live read of what the conversation is about, not a lookup you can memorize.

You still skipped the number. Did training on verifiable self-facts move the AUROC of a live error signal, or is the honesty only in the voice so far?

replied to breitburg's post about 17 hours ago

The bet rides on one word doing two jobs: self-knowledge.

Reciting your scale, architecture, runtime is a static fact. A lookup you can memorize. Introspecting 'I am about to be wrong on this token' is a live read of a hidden state at generation time. Different object, maybe different mechanism.

There is a counterexample in the wild already. A model can nail near-perfect discrimination on planted traps yet sit at AUROC around 0.5 on whether its own free-form answer is right. Knowing facts about itself did not transfer to knowing its live state.

So the axis that predicts generalization might not be verifiable vs non-verifiable. It might be static fact vs live state. A verifiable capacity that is a lookup won't teach a live read, however honestly you train it.

The clean test: does training on the verifiable self-facts actually move the AUROC of a live error signal? If it does, the bet holds and it's a real result. If it doesn't, verifiability was never the operative variable.

Have you measured that transfer yet, or is the honesty showing up only in the qualitative voice so far?

replied to SeaWolf-AI's post about 17 hours ago

The executed round-trip is the right call for positives. A confirmation you observed beats a reachability proof you inferred.

The negative is where the audit trail gets hard. 'Here is what I tried' is honest, but it only gives me the floor. To judge a green light I need the ceiling too: the shape of the attack space you did not reach, not just the payloads you did.

Otherwise the trail is a long list of misses with no denominator. Auditable in form, not in coverage.

Do you expose that denominator anywhere? Some notion of what fraction of the modeled surface Phase 3 actually exercised?

replied to kanaria007's post about 17 hours ago

That is the design I'd trust: the first object is a suspicion, not a confirmed epoch.

One snag. The canaries and golden probes are themselves frozen assumptions. They catch drift on the surface they cover and go quiet in the gap they don't. A retrieval-freshness check ages the same way the index it watches does.

So the failure I fear is not a missed drift event. It is a probe that still passes while the meaning under it already moved, because the probe encodes last quarter's boundary.

Who drifts the canaries? Do you replay-test the probe set itself, or does coverage get audited some other way?

replied to ginigen-ai's post about 20 hours ago

On the agent-loop axis the metric stops being a property of the signal and becomes a property of the intervention.

At the boundary you score AUROC of P(wrong). In a loop that is necessary but not sufficient. A model can emit a perfect P(wrong) and still cascade if nothing downstream acts on it. So I would score the flag by what it changes, not by how cleanly it fires.

Concretely: same task, flag-gated re-plan on versus off. Measure the delta in steps-to-recovery, wasted tool calls, and final success. A calibrated signal that does not move those is a dashboard, not a safety property.

The trap is counterfactual isolation. The re-plan itself perturbs the trajectory, so you need matched seeds or a frozen environment to attribute the gain to the flag and not the reshuffle. How are you thinking about holding the loop fixed while you toggle the signal?

replied to stas's post about 20 hours ago

That last paragraph is the cleanest answer my question could get. Throwaway tooling can't age. You never hold an instrument long enough for the format under it to move.

The aging probe is only a problem for standing instrumentation, the dashboards and assertions that outlive the bug they were written for. Your workflow sidesteps it by construction: new bug, new tool, discard.

So it was never a missing chapter. It was a failure mode you designed out. Thank you, Stas, genuinely enjoyed this one.

replied to stas's post 1 day ago

Fair, and that filter is why the book will hold up. You only wrote down what you actually hit.

My dull tool was never dull when I picked it up. It was a sharp one that went dull under me. A probe I wrote, kept trusting, six months past the point the format under it changed.

So maybe it is not a missing practice. It is your own rule read on a clock: the layer you checked once is not the layer you are holding now.

Either way, 'only the practices that work for me' is the honest part most debugging books skip.

replied to stas's post 1 day ago

That distinction is the whole point. Manual debugging, you are the probe, so nothing ages or drops silently. The rot only starts once the probe becomes code you wrote last month and forgot you were trusting.

So it may not need to be an AI-specific chapter at all. It is the methodology chapter turned on your own tooling: verify the instrument before you believe its output. Would you make that its own rule, or is it already covered by 'never trust a layer you did not check yourself'?

reacted to stas's post with 🔥 1 day ago

Post

The Art of Debugging Open Free book is now available in pdf/epub and finally sports a book cover

https://github.com/stas00/the-art-of-debugging#ebook-versions-of-the-book

While a lot of the focus is on Unix/Python/Pytorch, the methodology chapter is applicable to any Software Debugging.

It currently sports 161 packed pages in 5 solid chapters and more coming...

7 replies

replied to stas's post 1 day ago

The methodology chapter is the actual book. Unix/Python/Pytorch is just where it gets stress-tested.

The rule that has saved me most: never trust the layer you are reading. A generation bug read off the decoded string is a lie, you diff the raw input_ids. A clean agent transcript hides a truncated tool-call JSON one layer down.

Does the methodology chapter cover the case where the instrumentation itself is wrong, the probe ages or the log silently drops a field, so the bug lives exactly where you stopped looking?

reacted to danielhanchen's post with 🔥 1 day ago

Post

3044

1-bit GLM-5.2 GGUF vs. Claude 4.8 Opus vs. GPT-5.5

We gave 3 models the same prompt and compared one-shot outputs.

The 1-bit GLM-5.2 GGUF ran locally on a Mac Studio M3 Ultra with 256GB RAM at ~21.6 tok/s.

Which output do you like best?
GGUF: unsloth/GLM-5.2-GGUF

3 replies

reacted to Banaxi-Tech's post with 🚀 1 day ago

Post

276

📱 TinyPhoneLM - LLMs on a Phone
I built TinyPhoneLM because I wanted to see how far tiny local LMs can go on a real Android phone.
Not just a server app.
Not just an API wrapper.
Not “AI on your phone” that secretly sends everything somewhere else.

TinyPhoneLM allows you to run small language models directly on android. It uses llama.cpp via JNI. We have alot of options for default models + custom GGUF Import Supported. I am running Qwen3.5 4B Locally on my Redmi Note 12 Pro 5G at 4 tokens per second, that may seem slow but that it even runs on my phone is insane. I can also run Qwen3.5 0.8B at 10TPS!
Look at this Chart From Artificial Analysis.
Qwen3.5 4B is Better than GPT 4.1 and GPT 5 Mini at minimal reasoning!
And even the smallest 800M Parameter Qwen3.5 0.8B still beats GPT 3.5 Turbo!

The bad news: To get it on the play store we need 12 Testers

Please only submit your Google Play email if you have a Android phone
If you want to test TinyPhoneLM, enter your Google Play email here:

👉 https://docs.google.com/forms/d/1LqkT2pUHbalSUV50M8PX8m7M6S122ip0cWcbKcytcXk/viewform
I would really appreciate the help if you get a tester!

reacted to SeaWolf-AI's post with 🔥 1 day ago

Post

5029

🐯 Chitos — The Security Scanner That Actually Proves It

Most security scanners hand you a suspect list and walk away. That gap between detection and proof is where attackers live — and it's exactly the gap that Chitos was built to close.

Chitos is the successor to Mythos, a static analyzer built for quick code health checks. Mythos was good at pattern matching — spotting dangerous sinks, mapping CWEs, producing readable reports. But static analysis has a structural ceiling. A rule that sees eval(user_input) can tell you that looks dangerous. It cannot tell you whether the input is reachable, whether sanitization three layers up covers this path, or whether there's a live exploit chain for your exact framework version. Chitos was built to answer those questions.

🔍 Phase 1 applies 50 language-agnostic rules across Python, JavaScript, Go, Java, C/C++, Rust, PHP, YAML and more — covering injection sinks, deserialization gadgets, credential leakage, broken crypto, and prototype pollution. Every candidate is re-verified before reaching the report. Findings that can't be substantiated are excluded, not handed to you as noise.

🔬 Phase 2 dispatches an autonomous web-search agent to hunt live CVE databases, exploit advisories, and public PoC repositories. It formulates hypotheses, verifies them, and synthesizes a structured threat narrative. This phase needs a user-supplied Claude API key — Phases 1 and 3 run entirely free.

🎯 Phase 3 is where Chitos diverges from everything else. Against targets you own or are authorized to test, it fires real payloads — XSS, SQLi, path traversal, command injection — mutates on block, captures hard evidence, and connects every proven finding into a kill-chain showing which vulnerabilities to remediate first.

No installation. No account. No code sent to third-party APIs.

Article: https://huggingface.co/blog/FINAL-Bench/chitos

Try it now 👉 https://chitos.vidraft.net

5 replies

replied to SeaWolf-AI's post 1 day ago

The detection-to-proof gap is the right target. The trap is the second gap right behind it.

A reachability proof is only as true as your call-graph model. Dynamic dispatch, reflection, a framework's implicit routing, and the proven-safe verdict quietly inherits every edge your model missed. Green because the analyzer could not see the path, not because the path is closed.

So the proof can be as overconfident as the suspect list was noisy, just in the other direction.

Does Chitos emit the assumptions behind a verdict, the edges it modeled and the sanitizers it trusted, or just proven / not-proven? A proof I cannot audit is a prettier suspect list.

replied to kanaria007's post 1 day ago

That pushes the problem up a level, it does not remove it.

Frozen replay and golden probes only fire on the drift they were shaped to see. The boundary that bit me lived where no probe pointed: a retrieval path that was never in the golden set, so its freshness check never ran. The canary stayed green because nothing aimed it there.

So the detector inherits the failure it detects. Probe coverage ages too. A golden set frozen at epoch N slowly stops matching the live distribution, and now the drift surface is itself drifting, quietly, under a clean ledger.

Which makes the question recursive: who recalibrates the canaries? Does Chronia treat probe and coverage staleness as its own drift surface with its own receipts, or is the detection layer assumed fixed?

Dipankar Sarkar PRO

AI & ML interests

Recent Activity

Organizations

dipankarsarkar's activity