ZhengQi Wan's picture

ZhengQi Wan

Vanqi

·

42111058

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

updated a collection 4 days ago

Interesting work but not directly related

upvoted a paper 5 days ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

View all activity

Organizations

None yet

upvoted a paper 4 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 18 days ago • 241

upvoted a paper 5 days ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published 6 days ago • 85

upvoted a paper 6 days ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 7 days ago • 86

upvoted 2 papers 12 days ago

Introspective Diffusion Language Models

Paper • 2604.11035 • Published 14 days ago • 22

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 14 days ago • 138

upvoted 2 papers 14 days ago

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Paper • 2604.03841 • Published 23 days ago • 10

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 19 days ago • 323

upvoted 2 papers 19 days ago

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 21 days ago • 45

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 21 days ago • 203

upvoted 2 papers 20 days ago

VOID: Video Object and Interaction Deletion

Paper • 2604.02296 • Published 25 days ago • 53

LightThinker++: From Reasoning Compression to Memory Management

Paper • 2604.03679 • Published 23 days ago • 37

upvoted a paper 28 days ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

upvoted 3 papers about 1 month ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 109

WorldAgents: Can Foundation Image Models be Agents for 3D World Models?

Paper • 2603.19708 • Published Mar 20 • 13

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95

upvoted 5 papers about 2 months ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published Mar 3 • 63

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published Mar 8 • 86

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published Mar 4 • 121