AMIRAN KURTANIDZE's picture

AMIRAN KURTANIDZE

sunsulaki

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 1 day ago

APPO: Agentic Procedural Policy Optimization

upvoted a paper 1 day ago

On the Geometry of On-Policy Distillation

View all activity

Organizations

None yet

upvoted 10 papers 1 day ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 7 days ago • 13

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 8 days ago • 73

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published 14 days ago • 72

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 8 days ago • 89

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 22 days ago • 90

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Paper • 2606.11926 • Published 9 days ago • 111

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 14 days ago • 115

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 16 days ago • 121

MiniMax Sparse Attention

Paper • 2606.13392 • Published 8 days ago • 138

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 4 days ago • 91

upvoted a paper 9 days ago

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 2

upvoted a paper 26 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

upvoted a paper 27 days ago

Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models

Paper • 2605.17672 • Published May 17 • 23

upvoted a paper about 1 month ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 82

upvoted 2 papers 3 months ago

The Universal Normal Embedding

Paper • 2603.21786 • Published Mar 23 • 16

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Paper • 2511.10645 • Published Nov 13, 2025 • 13

upvoted a collection 3 months ago

ParoQuant

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 24 items • Updated 10 days ago • 26

upvoted 3 papers 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Paper • 2602.10063 • Published Feb 10 • 75

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 166