5 11

wuyuhan

yuhanwuuu

https://wuyuhan3z.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a model 9 months ago

deepseek-ai/DeepSeek-V3.1

liked a model 9 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

View all activity

Organizations

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 6 days ago • 2.02M • • 3.87k

liked 4 models 9 months ago

liked a dataset 9 months ago

ByteDance-Seed/WideSearch

Viewer • Updated Sep 8, 2025 • 200 • 15.6k • 41

liked 3 datasets 11 months ago

AIM-Harvard/MedBrowseComp

Viewer • Updated Aug 27, 2025 • 1.14k • 335 • 8

xbench/ScienceQA

Viewer • Updated Jun 18, 2025 • 100 • 35 • 8

R-Bench/R-Bench

Viewer • Updated May 27, 2025 • 3.52k • 5.75k • 22

liked a model 12 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29, 2025 • 1.73M • • 2.45k

reacted to merve's post with 🔥 12 months ago

Post

3155

what happened in open AI past week? so many vision LM & omni releases 🔥 merve/releases-23-may-68343cb970bbc359f9b5fb05

multimodal 💬🖼️
> new moondream (VLM) is out: it's 4-bit quantized (with QAT) version of moondream-2b, runs on 2.5GB VRAM at 184 tps with only 0.6% drop in accuracy (OS) 🌚
> ByteDance released BAGEL-7B, an omni model that understands and generates both image + text. they also released Dolphin, a document parsing VLM 🐬 (OS)
> Google DeepMind dropped MedGemma in I/O, VLM that can interpret medical scans, and Gemma 3n, an omni model with competitive LLM performance

> MMaDa is a new 8B diffusion language model that can generate image and text

LLMs
> Mistral released Devstral, a 24B coding assistant (OS) 👩🏻‍💻
> Fairy R1-32B is a new reasoning model -- distilled version of DeepSeek-R1-Distill-Qwen-32B (OS)
> NVIDIA released ACEReason-Nemotron-14B, new 14B math and code reasoning model
> sarvam-m is a new Indic LM with hybrid thinking mode, based on Mistral Small (OS)
> samhitika-0.0.1 is a new Sanskrit corpus (BookCorpus translated with Gemma3-27B)

image generation 🎨
> MTVCrafter is a new human motion animation generator