along's picture

along

oncealong

·

AI & ML interests

None yet

Organizations

None yet

upvoted 4 articles 9 months ago

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 418

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 340

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 536

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

+1

merve, andsteing, pcuenq

•

May 14, 2024

• 287

upvoted a collection 9 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 6 items • Updated Mar 2 • 168

upvoted an article 9 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

upvoted a paper about 1 year ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8, 2025 • 186