13 6

guoguoc PRO

woshichaoren123

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago

how2everything/how2bench

upvoted a paper 13 days ago

Video Analysis and Generation via a Semantic Progress Function

updated a dataset 16 days ago

woshichaoren123/vis_data_0424_data

View all activity

Organizations

None yet

liked a dataset 10 days ago

how2everything/how2bench

Viewer • Updated Feb 9 • 7k • 54 • 2

upvoted a paper 13 days ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published 17 days ago • 63

updated a dataset 16 days ago

woshichaoren123/vis_data_0424_data

Updated 16 days ago • 54

published a dataset 16 days ago

woshichaoren123/vis_data_0424_data

Updated 16 days ago • 54

published a Space 16 days ago

Vis Data 0424

🏆

Generate a personalized greeting from a name

upvoted a paper 17 days ago

Seeing Fast and Slow: Learning the Flow of Time in Videos

Paper • 2604.21931 • Published 18 days ago • 19

updated a dataset 20 days ago

woshichaoren123/egoplan_video

Updated 20 days ago • 60

published a dataset 20 days ago

woshichaoren123/egoplan_video

Updated 20 days ago • 60

upvoted a paper 24 days ago

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Paper • 2604.14125 • Published 26 days ago • 21

upvoted a paper about 1 month ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 187

updated a dataset about 1 month ago

woshichaoren123/text

Viewer • Updated Apr 7 • 2.69M • 4

published a dataset about 1 month ago

woshichaoren123/text

Viewer • Updated Apr 7 • 2.69M • 4

updated a Space about 1 month ago

Test

💬

Locate objects in images and videos

upvoted 3 papers about 2 months ago

WorldAgents: Can Foundation Image Models be Agents for 3D World Models?

Paper • 2603.19708 • Published Mar 20 • 13

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

Paper • 2603.18524 • Published Mar 19 • 58

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Paper • 2603.15618 • Published Mar 16 • 21

published a Space about 2 months ago

Test

💬

Locate objects in images and videos

upvoted a paper about 2 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 21

guoguoc PRO

AI & ML interests

Recent Activity

Organizations

woshichaoren123's activity

Vis Data 0424

Test

Test