Sayuri Kurokawa

why-42

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

One Forward Beats Two: InnerZoom for Accurate and Efficient GUI Grounding

liked a Space 6 days ago

AimeeBingmouQu/ProtectBirds

liked a dataset 15 days ago

datvo37092/datvo37092

View all activity

Organizations

None yet

upvoted a paper 2 days ago

One Forward Beats Two: InnerZoom for Accurate and Efficient GUI Grounding

Paper • 2606.30084 • Published 7 days ago • 6

upvoted 3 papers about 1 month ago

Macaron-A2UI: A Model for Generative UI in Personal Agents

Paper • 2605.24830 • Published May 24 • 84

SOD: Step-wise On-policy Distillation for Small Language Model Agents

Paper • 2605.07725 • Published May 8 • 25

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

upvoted a paper about 2 months ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

upvoted 6 papers 3 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents

Paper • 2604.04979 • Published Apr 4 • 11

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 116

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 344

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published Mar 26 • 118

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 138

upvoted 7 papers 4 months ago

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Paper • 2603.04743 • Published Mar 5 • 53

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

upvoted 2 papers 5 months ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 246