5 11 84

Wenbo Hu

gordonhu

https://gordonhu608.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

OpenGVLab/ScaleCUA-Data

liked a dataset 3 days ago

mvp-lab/LLaVA-OneVision-2-Data

liked a dataset 4 days ago

microsoft/synthetic-computers-at-scale

View all activity

Organizations

liked 2 datasets 3 days ago

OpenGVLab/ScaleCUA-Data

Preview • Updated Sep 27, 2025 • 6.17k • 31

mvp-lab/LLaVA-OneVision-2-Data

Viewer • Updated 19 days ago • 24 • 205k • 24

liked 2 datasets 4 days ago

microsoft/synthetic-computers-at-scale

Viewer • Updated 29 days ago • 98 • 2.12k • 18

xlangai/AgentNet

Preview • Updated Jan 9 • 2.48k • 82

liked a model 4 days ago

Hcompany/Holo3-35B-A3B

Image-Text-to-Text • 35B • Updated Apr 2 • 52.1k • 346

liked a dataset 5 days ago

Qwen/WebWorldData

Viewer • Updated 22 days ago • 463k • 1.33k • 64

liked a model 26 days ago

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 2.96M • • 1.44k

liked 2 models about 1 month ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated Apr 20 • 1.51M • • 1.16k

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 24 days ago • 5.92M • • 4.45k

liked a dataset about 1 month ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 155k • 486

upvoted 2 papers about 1 month ago

Interleaving Reasoning for Better Text-to-Image Generation

Paper • 2509.06945 • Published Sep 8, 2025 • 16

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

Paper • 2510.08457 • Published Oct 9, 2025 • 14

updated a model about 1 month ago

InternRobotics/G2VLM-Qwen2-VL-2B

Image-Text-to-Text • 2B • Updated Apr 18 • 10 • 2

liked a model about 1 month ago

InternRobotics/G2VLM-Qwen2-VL-2B

Image-Text-to-Text • 2B • Updated Apr 18 • 10 • 2

published a model about 1 month ago

InternRobotics/G2VLM-Qwen2-VL-2B

Image-Text-to-Text • 2B • Updated Apr 18 • 10 • 2

liked 2 datasets about 2 months ago

PolarSeeker/OpenSeeker-v1-Data

Viewer • Updated Mar 17 • 11.7k • 1.77k • 44

zlab-princeton/Vero-600k

Viewer • Updated Apr 13 • 606k • 67.3k • 32

authored 3 papers about 2 months ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Paper • 2308.09936 • Published Aug 19, 2023 • 1

Matryoshka Query Transformer for Large Vision-Language Models

Paper • 2405.19315 • Published May 29, 2024 • 1

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Paper • 2410.08182 • Published Oct 10, 2024

Wenbo Hu

AI & ML interests

Recent Activity

Organizations

gordonhu's activity