32 23

AMIRAN KURTANIDZE

sunsulaki

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 2 days ago

APPO: Agentic Procedural Policy Optimization

upvoted a paper 2 days ago

On the Geometry of On-Policy Distillation

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 7 days ago • 13

upvoted 9 papers 2 days ago

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 8 days ago • 73

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published 14 days ago • 72

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 8 days ago • 89

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 16 days ago • 121

MiniMax Sparse Attention

Paper • 2606.13392 • Published 8 days ago • 138

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 4 days ago • 94

liked 2 models 6 days ago

chopratejas/kompress-v2-base

Token Classification • 0.2B • Updated 9 days ago • 657 • 10

zjukg/OntoTune-sft-Llama3-8B

8B • Updated Jun 7, 2025 • 4 • 4

upvoted a paper 9 days ago

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 2

liked 3 models 9 days ago

Ray2333/GRM-Llama3.2-3B-rewardmodel-ft

Text Classification • 3B • Updated Apr 30, 2025 • 2.16k • 14

nicholasKluge/RewardModel

Text Classification • 0.1B • Updated Jun 9, 2025 • 28 • 2

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated Oct 20, 2025 • 83.7k • 355

liked a model 12 days ago

teapotai/tinyteapot

Text Generation • 77M • Updated Feb 23 • 286 • 23

liked a model 19 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 6 days ago • 183k • 2.16k

liked a dataset 19 days ago

openbmb/UltraData-SFT-2605

Updated 21 days ago • 45.8k • 347

upvoted a paper 26 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

AMIRAN KURTANIDZE

AI & ML interests

Recent Activity

Organizations

sunsulaki's activity