view article Article FineVideo: behind the scenes +4 mfarre, andito, lewtun, lvwerra, pcuenq, thomwolf β’ Sep 23, 2024 β’ 35
Multimodal Chaptering for Long-Form TV Newscast Video Paper β’ 2406.17590 β’ Published Mar 20, 2024 β’ 2
Moments Lab Research papers Collection All of Moments Lab Research papers available on Hugging Face β’ 3 items β’ Updated Sep 2, 2024 β’ 1
Towards Retrieval Augmented Generation over Large Video Libraries Paper β’ 2406.14938 β’ Published Jun 21, 2024 β’ 22
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model +1 merve, andsteing, pcuenq β’ May 14, 2024 β’ 287
Inserting Faces inside Captions: Image Captioning with Attention Guided Merging Paper β’ 2405.02305 β’ Published Mar 20, 2024 β’ 2
view article Article Welcome Llama 3 - Meta's new open LLM +3 philschmid, osanseviero, pcuenq, ybelkada, lvwerra β’ Apr 18, 2024 β’ 295