Rin's picture

Rin

hu5enpai

·

AI & ML interests

None yet

Organizations

New activity in deepseek-ai/DeepSeek-OCR-2 4 months ago

ms-swift has supported inference, deployment, and fine-tuning of the DeepSeek-OCR-2 model.

#5 opened 4 months ago by

commented a paper 6 months ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 43 •

New activity in PaddlePaddle/PaddleOCR-VL 7 months ago

ms-swift has supported inference, deployment, and fine-tuning of the PaddleOCR-VL model.

#42 opened 7 months ago by

commented a paper 8 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8 •

commented a paper 9 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 189 •

commented 2 papers 10 months ago

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published May 20, 2025 • 5 •

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190 •

New activity in Qwen/Qwen3-Coder-480B-A35B-Instruct 10 months ago

👍👍

#19 opened 10 months ago by

New activity in ChenShawn/DeepEyes-Datasets-47k 10 months ago

Unable to load the dataset

#2 opened 10 months ago by

New activity in microsoft/Florence-2-large-ft almost 2 years ago

Swift now supports inference, training, and deployment of the Florence models.

#14 opened almost 2 years ago by

New activity in microsoft/Florence-2-large almost 2 years ago

How to Finetune?

#19 opened almost 2 years ago by

Fix incorrect bos_token, eos_token, and pad_token ids in config.json

#17 opened almost 2 years ago by

New activity in liuhaotian/LLaVA-Instruct-150K almost 2 years ago

Unable to load dataset.

#10 opened over 2 years ago by

New activity in OpenGVLab/InternVL-Chat-V1-5 about 2 years ago

Swift now supports inference, training of InternVL-Chat-V1-5

#11 opened about 2 years ago by