Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
12
Chongyu Fan
a-F1
Follow
flyingbugs's profile picture
hyoseo's profile picture
wcs2024's profile picture
3 followers
·
7 following
https://chongyu-fan.netlify.app/
a-F1
chongyu-fan-408a0126a
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR
submitted
a paper
about 1 month ago
Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR
updated
a model
7 months ago
OPTML-Group/TOFU-origin-Llama-2-7b-chat
View all activity
Organizations
a-F1
's models
206
Sort: Recently updated
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5-Epoch_2
Text Generation
•
2B
•
Updated
Mar 5, 2025
•
2
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-4-Epoch_2
Text Generation
•
2B
•
Updated
Mar 5, 2025
•
2
a-F1/try
Updated
Mar 5, 2025
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5
Text Generation
•
2B
•
Updated
Mar 3, 2025
•
1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-5
Text Generation
•
2B
•
Updated
Mar 3, 2025
•
2
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-5
Text Generation
•
2B
•
Updated
Mar 3, 2025
•
2
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-6
Text Generation
•
2B
•
Updated
Mar 3, 2025
•
2
a-F1/Qwen-1.5B-SFT-OpenR1-LR_2e-5
Text Generation
•
2B
•
Updated
Mar 2, 2025
•
2
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-5
Text Generation
•
2B
•
Updated
Mar 2, 2025
•
3
a-F1/Qwen-1.5B-SFT-OpenR1-LR_1e-5
Text Generation
•
2B
•
Updated
Mar 2, 2025
•
2
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-6
Text Generation
•
2B
•
Updated
Mar 2, 2025
•
2
a-F1/Qwen-1.5B-SFT-OpenR1
2B
•
Updated
Mar 2, 2025
•
5
a-F1/Qwen-7B-SFT-S1
Text Generation
•
8B
•
Updated
Mar 1, 2025
•
6
•
1
a-F1/DeepSeek-7B-SFT-S1
8B
•
Updated
Feb 28, 2025
•
2
a-F1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Feb 27, 2025
•
3
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-bi
Text Generation
•
2B
•
Updated
Feb 24, 2025
•
7
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-mixed
Text Generation
•
2B
•
Updated
Feb 24, 2025
•
3
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Feb 24, 2025
•
7
a-F1/Qwen2.5-1.5B-Open-R1-Distill-bi
Text Generation
•
2B
•
Updated
Feb 24, 2025
•
4
a-F1/Qwen2.5-1.5B-Open-R1-Distill-mixed
Text Generation
•
2B
•
Updated
Feb 23, 2025
•
5
a-F1/Qwen2.5-7B-Open-R1-Distill-mixed
Updated
Feb 21, 2025
a-F1/Qwen2.5-7B-Open-R1-Distill-bi
Text Generation
•
8B
•
Updated
Feb 21, 2025
•
4
a-F1/SimNPO_MUSE_News
Text Generation
•
7B
•
Updated
Oct 24, 2024
•
4
a-F1/SimNPO_MUSE_Books
Text Generation
•
7B
•
Updated
Oct 24, 2024
•
4
a-F1/SimNPO_TOFU_Forget10
Text Generation
•
7B
•
Updated
Oct 24, 2024
•
3
a-F1/SimNPO_TOFU_Forget05
Text Generation
•
7B
•
Updated
Oct 24, 2024
•
4
Previous
1
...
5
6
7
Next