34 3

Ankit

Ajax0564

Ajax0564

AI & ML interests

NLP

Recent Activity

upvoted a paper 3 days ago

Anisotropic Modality Align

upvoted an article 20 days ago

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

liked a model 2 months ago

tencent/Penguin-VL-2B

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Anisotropic Modality Align

Paper • 2605.07825 • Published 6 days ago • 26

upvoted an article 20 days ago

Article

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

guan-wang

•

Feb 11

• 14

upvoted an article 2 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 159

upvoted an article 4 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 93

upvoted a paper 4 months ago

Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs

Paper • 2601.05851 • Published Jan 9 • 3

upvoted a paper 5 months ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published Dec 17, 2025 • 17

upvoted an article 5 months ago

Article

Why You Should Care About Partial Differential Equations (PDEs)

hugging-science

•

Dec 12, 2025

• 45

upvoted an article 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 380

upvoted a paper 6 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 134

upvoted an article 7 months ago

Article

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

lightonai

•

Oct 23, 2025

• 73

upvoted 2 papers 8 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17, 2025 • 37

SAIL-VL2 Technical Report

Paper • 2509.14033 • Published Sep 17, 2025 • 44

upvoted an article 9 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 98

upvoted 2 papers 10 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 126

upvoted an article 10 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

rishiraj

•

Jun 26, 2025

• 50

upvoted a paper 10 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 132

upvoted 2 papers 11 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29, 2025 • 62

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19, 2025 • 89

upvoted an article 11 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 164

Ankit

AI & ML interests

Recent Activity

Organizations

Ajax0564's activity

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

NEO-unify: Building Native Multimodal Unified Models End to End

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Why You Should Care About Partial Differential Equations (PDEs)

Continuous batching from first principles

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Learn the Hugging Face Kernel Hub in 5 Minutes