12 8

Zhu Yiran

zhuyiran86

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

upvoted a paper 11 days ago

Sessa: Selective State Space Attention

liked a model 21 days ago

GaryYang123/Meme-Qwen-7B-Instruct

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

Paper • 2605.00877 • Published 17 days ago • 15

upvoted a paper 11 days ago

Sessa: Selective State Space Attention

Paper • 2604.18580 • Published 21 days ago • 13

liked a model 21 days ago

GaryYang123/Meme-Qwen-7B-Instruct

Updated 14 days ago • 30 • 62

liked a model 26 days ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated 28 days ago • 2.69k • 905

upvoted a paper 28 days ago

Zero-shot World Models Are Developmentally Efficient Learners

Paper • 2604.10333 • Published Apr 11 • 7

upvoted a paper 29 days ago

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Paper • 2604.03841 • Published Apr 4 • 10

liked a dataset 30 days ago

world-igr-plum/regions

Updated Jun 17, 2025 • 375k • 10

upvoted 2 papers about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 501

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 324

liked a model about 1 month ago

amazon/chronos-bolt-base

Time Series Forecasting • 0.2B • Updated Nov 21, 2025 • 7.38M • 90

upvoted a paper about 1 month ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

liked a model about 1 month ago

PhoenixHu/grpo_internvl2_5_how2sign_1b_bleu1_rouge_0403_metta

0.9B • Updated Apr 8 • 27 • 1

upvoted a paper about 1 month ago

Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models

Paper • 2603.29497 • Published Mar 31 • 6

liked a dataset about 1 month ago

RidheshBhati/Complete_Data_Source_100K_HOURS

Viewer • Updated 14 days ago • 1.96M • 13.5k • 4

upvoted a paper about 1 month ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 341

upvoted a paper about 2 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper 2 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151

liked 2 models 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.63M • • 13.3k

zai-org/GLM-5

Text Generation • 754B • Updated Apr 5 • 194k • • 2.09k

upvoted a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

Zhu Yiran

AI & ML interests

Recent Activity

Organizations

zhuyiran86's activity