Commit History
Update training/AgentDebuggerEnv_GRPO_Training.ipynb a2cb0a0 verified
Delete HANDOVER.md ff0b4f6 verified
fix: use GitHub raw URLs for images so README renders on HF Space 3eb8edc
shank commited on
Added blog post c02b65b
shank commited on
Added blog post eacdf84
shank commited on
Added readme again 3165754
shank commited on
Added readme f6f33cf
shank commited on
Update: Added final imporvements for hackathon 713f336
shank commited on
chore: clean up local dev files and temporary virtual environments 59986c5
shank commited on
chore: clean up local dev files and temporary virtual environments 374c6cc
shank commited on
Update: Triggering the full run 75cd77b
shank commited on
Fix: batch%num_generations math 2b499e7
shank commited on
Cuda returns false fixed b8172c5
shank commited on
COMPUTE_DRIVE fix 77156dd
shank commited on
Fix: Removed BitsandBytes bdec91d
shank commited on
Fix: Fixed dependancy issues db12eaa
shank commited on
Revert "Fix: Dockerfile" dc7eb3f
shank commited on
Fix: Dockerfile 5dcd156
shank commited on
Fix: Fixed again again accb271
shank commited on
Fix: Fixed again 9864e61
shank commited on
Fix: Fixing Again 6747185
shank commited on
Fix: Fixing 18b4e8a
shank commited on
Fix: Trying to fix dependency issues 024f3c7
shank commited on
Fix: Fixed file cb09ef1
shank commited on
fix: serialize bug_metadata as JSON to fix pyarrow mixed-type error 4668456
shank commited on
fix: upgrade bitsandbytes>=0.49.0 (triton.ops), switch to Qwen2.5-Coder-3B a2fa47a
shank commited on
fix: torch at build time, remove mergekit (conflicts accelerate/peft/trl) 2bfaf77
shank commited on
fix: empty requirements.txt, install training deps at runtime 5d0b2d4
shank commited on
fix: remove wandb - click conflict with gradio causes resolution-too-deep 2005cd2
shank commited on
fix: resolve pip dependency conflicts for HF Spaces build d0d5f60
shank commited on
Fix: loosen strict dependencies to prevent pip backtracking 2e3be87
shank commited on
fix: remove hardcoded torch from requirements for HF space fe04772
shank commited on
chore: normalize dataset inputs and fix mergekit dependency for TRL 0.14.0 e67270e
shank commited on
Add HANDOVER.md: full project state, deps, training instructions, known fixes 97aad17
shank commited on
Auto-detect GPU: bfloat16+batch2+gen8 on A100, float16+batch1+gen4 on T4 — same script works on both ea6fe4e
shank commited on
Reduce max_completion_length to 160 for T4 speed: target 1000 steps in <8hrs 9487853
shank commited on
Fix: bump bitsandbytes to 0.45.3 for CUDA 12.x support on Kaggle T4 6bf2fbb
shank commited on
Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8 73f957d
shank commited on
Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0 8b16369
shank commited on
Update: Added testing a5c67b3
shank commited on
Align gradio version with Hugging Face Space builder2 633a3b7
shank commited on
Add dockerignore to reduce Space build context c945597
shank commited on
Stabilize Space runtime: pin ML deps and disable runtime package drift 663b8db
shank commited on
Pin torch to cu121 build + use model.device instead of hardcoded cuda string 8f291e0
shank commited on
Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100 c325ad7
shank commited on
Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh 5eea2dd
shank commited on
Reduce training to 500 steps with tightened curriculum for A10G budget ba8df98
shank commited on
Add Gradio training monitor and fix subprocess python path b37b2eb
shank commited on
Fix eval device selection with CUDA-safe fallback dc8001b
shank commited on