Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
10
Xiangyuan Xue
xxyQwQ
Follow
GoodEnough's profile picture
quantumfr's profile picture
mhjiang0408's profile picture
4 followers
·
1 following
https://xxyqwq.cn/
xxyQwQ
AI & ML interests
LLM-Based Agents, Multi-Agent Systems, Reinforcement Learning
Recent Activity
authored
a paper
about 10 hours ago
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
upvoted
a
paper
2 days ago
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
updated
a collection
20 days ago
StraTA Miscellaneous
View all activity
Organizations
xxyQwQ
's models
17
Sort: Recently updated
xxyQwQ/train-ppo-sciworld-text-qwen2.5-7b
8B
•
Updated
20 days ago
•
15
xxyQwQ/train-grpo-sciworld-text-qwen2.5-7b
8B
•
Updated
24 days ago
•
16
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-ultimate-version
3B
•
Updated
29 days ago
•
10
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-diverse-version
3B
•
Updated
29 days ago
•
16
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-judgment-version
3B
•
Updated
29 days ago
•
19
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-vanilla-version
3B
•
Updated
29 days ago
•
19
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-diverse-version
3B
•
Updated
30 days ago
•
17
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-judgment-version
3B
•
Updated
30 days ago
•
17
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-ultimate-version
3B
•
Updated
30 days ago
•
16
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-vanilla-version
3B
•
Updated
30 days ago
•
18
xxyQwQ/train-ppo-webshop-text-qwen2.5-7b
8B
•
Updated
30 days ago
•
18
xxyQwQ/train-ppo-alfworld-text-qwen2.5-7b
8B
•
Updated
30 days ago
•
17
xxyQwQ/train-grpo-webshop-text-qwen2.5-7b
8B
•
Updated
30 days ago
•
19
xxyQwQ/train-grpo-alfworld-text-qwen2.5-7b
8B
•
Updated
30 days ago
•
16
xxyQwQ/train-strata-sciworld-text-qwen2.5-7b
8B
•
Updated
about 1 month ago
•
4
xxyQwQ/train-strata-webshop-text-qwen2.5-7b
8B
•
Updated
Mar 26
•
1
xxyQwQ/train-strata-alfworld-text-qwen2.5-7b
8B
•
Updated
Mar 26
•
1