Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Fu's picture
1 5 9

Fu

CocoFans
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models
upvoted a paper 5 months ago
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
commentedon a paper 6 months ago
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration
View all activity

Organizations

None yet

upvoted a paper 1 day ago

Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models

Paper • 2604.00161 • Published 26 days ago • 1
upvoted a paper 5 months ago

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Paper • 2511.13026 • Published Nov 17, 2025 • 26
upvoted 2 papers 6 months ago

Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs

Paper • 2506.22139 • Published Jun 27, 2025 • 2

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Paper • 2510.27266 • Published Oct 31, 2025 • 21
upvoted a paper 7 months ago

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent

Paper • 2509.15566 • Published Sep 19, 2025 • 14
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs