arxiv:2505.19731
Kashif Rasul
kashif
AI & ML interests
Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning
Recent Activity
new activity about 4 hours ago
HuggingFaceH4/on-policy-distillation:Add byte-offset alignment update and canonical URL updated a Space about 4 hours ago
HuggingFaceH4/on-policy-distillation upvoted a paper 2 days ago
Accelerating RL for LLM Reasoning with Optimal Advantage Regression