IFDecorator - a guox18 Collection

guox18 's Collections

IFDecorator

updated Mar 2

Dataset and Models for ''IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards''

guox18/IFDecorator

Preview • Updated Aug 8, 2025 • 217 • 2

Note Datasets
guox18/Qwen2.5-7B-Instruct-IFDecorator

Text Generation • 8B • Updated Aug 8, 2025 • 7
guox18/Llama3.1-8B-Instruct-IFDecorator

8B • Updated Aug 10, 2025
guox18/Qwen3-8B-IFDecorator

8B • Updated Aug 7, 2025 • 1
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Paper • 2508.04632 • Published Aug 6, 2025 • 2