8 16

Joseph Sanchez

wu-wenhao19

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

tencent/Hy-MT2-1.8B

liked a dataset about 23 hours ago

fka/prompts.chat

liked a model 1 day ago

mistralai/Mistral-7B-Instruct-v0.3

View all activity

Organizations

None yet

liked a model about 22 hours ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated about 23 hours ago • 564 • 281

liked a dataset about 23 hours ago

fka/prompts.chat

Viewer • Updated about 24 hours ago • 1.83k • 50.7k • 9.7k

liked a model 1 day ago

mistralai/Mistral-7B-Instruct-v0.3

7B • Updated Dec 3, 2025 • 4.47M • 2.59k

liked a model 5 days ago

PeterPanonly/Qwen2.5-VL-3B-Instruct-Thinking-SubQ

Updated 4 days ago • 1

upvoted a paper 5 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 10 days ago • 55

upvoted a paper 7 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 10 days ago • 261

liked a model 9 days ago

FacebookAI/xlm-roberta-base

Fill-Mask • 0.3B • Updated Feb 19, 2024 • 22.1M • • 832

upvoted 2 papers 12 days ago

SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment

Paper • 2605.04012 • Published 18 days ago • 11

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 16 days ago • 186

liked a dataset 16 days ago

jat-project/jat-dataset-tokenized

Viewer • Updated Dec 22, 2023 • 32M • 746k • 7

upvoted a paper 22 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 23 days ago • 217

liked a model 29 days ago

rroshann/sec-sentiment-sftgrpo-deepseek-14b

Text Generation • 15B • Updated 29 days ago • 327 • • 1

liked a model 30 days ago

kmseong/llama2_7b_base-gsm8k_lora_ft_lr3e-5

7B • Updated 30 days ago • 30 • 1

upvoted a paper 30 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 240

liked 2 models about 1 month ago

tencent/HY-World-2.0

Image-to-3D • Updated 2 days ago • 2.7k • 655

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 986 • 906

upvoted a paper about 1 month ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 325

liked a model about 1 month ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 14.3M • • 704

liked a dataset about 1 month ago

rainbowrobotics/simtos_one_item_0409_rel

Viewer • Updated Apr 12 • 296k • 66 • 1

liked a model about 1 month ago

hector-gr/RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-batchcov0only-cold-math

Text Generation • 8B • Updated Apr 10 • 8 • 1

Joseph Sanchez

AI & ML interests

Recent Activity

Organizations

wu-wenhao19's activity