Denis Akhiyarov

dtanow

7 100 47

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 1 day ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

upvoted a paper 20 days ago

MiniMax Sparse Attention

upvoted a paper 23 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

View all activity

Organizations

upvoted a paper 1 day ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper • 2606.28733 • Published 6 days ago • 138

upvoted a paper 20 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 22 days ago • 148

upvoted a paper 23 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 28 days ago • 122

upvoted an article 23 days ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

23 days ago

• 44

liked a Space about 1 month ago

MTEB Leaderboard

📊

7.53k

Embedding Leaderboard

upvoted 3 papers about 1 month ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 134

Hyperagents

Paper • 2603.19461 • Published Mar 19 • 51

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

upvoted 2 papers about 2 months ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 76

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

liked a model about 2 months ago

nvidia/llama-nemotron-embed-vl-1b-v2

upvoted an article 2 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted 3 papers 3 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 329

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

Paper • 2604.02007 • Published Apr 2 • 14

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

submitted a paper to Daily Papers 3 months ago

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

upvoted 2 papers 3 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

liked a dataset 3 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 78 • 71

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

MTEB Leaderboard

Vision Language Models (Better, faster, stronger)

A New Framework for Evaluating Voice Agents (EVA)