Reinforcement Learning
Transformers
English
post-training
distillation
agentic-coding
composer-2.5
cursor
kimi-k2
grpo
dapo
diloco
openenv
trl
verl
research
methodology
Instructions to use Codeseys/composer-replication-framework with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Codeseys/composer-replication-framework with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Codeseys/composer-replication-framework", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Commit History
Wave 12: close V1-V8 brief — GPU smoke, SDPO firing, real-trace e2e d88715c
Wave 11: cross-model adversarial review + honest down-revision f16fa23
Spike 007: include synthetic_session.jsonl fixture in repo a35a8d7
Wave 7+8+9: spikes 006/007/008 — close vision-validation gaps V2/V5/V8 57af35d
Wave 4: data collator + loss composition smoke (38/38 tests pass) 157cdba
baladithyab commited on
Wave 3: integration architecture + spike-005 trainer skeleton (16 tests pass) fd77f74
baladithyab commited on
Integrate Cursor blog directly + audit research note + add SDPO/OPSD link 1cede23
baladithyab commited on