Spaces:

agentDebugger
/

AgentDebugger-training-v3

Running

App Files Files Community

AgentDebugger-training-v3

Commit History

Update README.md

8f19095

Running
verified

shashaank0707 commited on 3 days ago

Update training/AgentDebuggerEnv_GRPO_Training.ipynb

a2cb0a0
verified

shashaank0707 commited on 5 days ago

Delete HANDOVER.md

ff0b4f6
verified

shashaank0707 commited on 5 days ago

fix: use GitHub raw URLs for images so README renders on HF Space

3eb8edc

shank commited on 5 days ago

Added blog post

c02b65b

shank commited on 5 days ago

Added blog post

eacdf84

shank commited on 5 days ago

Added readme again

3165754

shank commited on 5 days ago

Added readme

f6f33cf

shank commited on 5 days ago

Update: Added final imporvements for hackathon

713f336

shank commited on 5 days ago

chore: clean up local dev files and temporary virtual environments

59986c5

shank commited on 5 days ago

chore: clean up local dev files and temporary virtual environments

374c6cc

shank commited on 5 days ago

Update: Triggering the full run

75cd77b

shank commited on 6 days ago

Fix: batch%num_generations math

2b499e7

shank commited on 6 days ago

Cuda returns false fixed

b8172c5

shank commited on 6 days ago

COMPUTE_DRIVE fix

77156dd

shank commited on 6 days ago

Fix: Removed BitsandBytes

bdec91d

shank commited on 6 days ago

Fix: Fixed dependancy issues

db12eaa

shank commited on 6 days ago

Revert "Fix: Dockerfile"

dc7eb3f

shank commited on 6 days ago

Fix: Dockerfile

5dcd156

shank commited on 6 days ago

Fix: Fixed again again

accb271

shank commited on 6 days ago

Fix: Fixed again

9864e61

shank commited on 6 days ago

Fix: Fixing Again

6747185

shank commited on 6 days ago

Fix: Fixing

18b4e8a

shank commited on 6 days ago

Fix: Trying to fix dependency issues

024f3c7

shank commited on 6 days ago

Fix: Fixed file

cb09ef1

shank commited on 6 days ago

fix: serialize bug_metadata as JSON to fix pyarrow mixed-type error

4668456

shank commited on 6 days ago

fix: upgrade bitsandbytes>=0.49.0 (triton.ops), switch to Qwen2.5-Coder-3B

a2fa47a

shank commited on 6 days ago

fix: torch at build time, remove mergekit (conflicts accelerate/peft/trl)

2bfaf77

shank commited on 6 days ago

fix: empty requirements.txt, install training deps at runtime

5d0b2d4

shank commited on 6 days ago

fix: remove wandb - click conflict with gradio causes resolution-too-deep

2005cd2

shank commited on 6 days ago

fix: resolve pip dependency conflicts for HF Spaces build

d0d5f60

shank commited on 6 days ago

Fix: loosen strict dependencies to prevent pip backtracking

2e3be87

shank commited on 6 days ago

fix: remove hardcoded torch from requirements for HF space

fe04772

shank commited on 6 days ago

chore: normalize dataset inputs and fix mergekit dependency for TRL 0.14.0

e67270e

shank commited on 6 days ago

Add HANDOVER.md: full project state, deps, training instructions, known fixes

97aad17

shank commited on 6 days ago

Auto-detect GPU: bfloat16+batch2+gen8 on A100, float16+batch1+gen4 on T4 — same script works on both

ea6fe4e

shank commited on 6 days ago

Reduce max_completion_length to 160 for T4 speed: target 1000 steps in <8hrs

9487853

shank commited on 6 days ago

Fix: bump bitsandbytes to 0.45.3 for CUDA 12.x support on Kaggle T4

6bf2fbb

shank commited on 6 days ago

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8

73f957d

shank commited on 6 days ago

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0

8b16369

shank commited on 6 days ago

Update: Added testing

a5c67b3

shank commited on 6 days ago

Align gradio version with Hugging Face Space builder2

633a3b7

shank commited on 6 days ago

Add dockerignore to reduce Space build context

c945597

shank commited on 6 days ago

Stabilize Space runtime: pin ML deps and disable runtime package drift

663b8db

shank commited on 6 days ago

Pin torch to cu121 build + use model.device instead of hardcoded cuda string

8f291e0

shank commited on 6 days ago

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100

c325ad7

shank commited on 6 days ago

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh

5eea2dd

shank commited on 6 days ago

Reduce training to 500 steps with tightened curriculum for A10G budget

ba8df98

shank commited on 6 days ago

Add Gradio training monitor and fix subprocess python path

b37b2eb

shank commited on 6 days ago

Fix eval device selection with CUDA-safe fallback

dc8001b

shank commited on 6 days ago

Commit History

Update README.md 8f19095 Running verified

Update training/AgentDebuggerEnv_GRPO_Training.ipynb a2cb0a0 verified

Delete HANDOVER.md ff0b4f6 verified

fix: use GitHub raw URLs for images so README renders on HF Space 3eb8edc

Added blog post c02b65b

Added blog post eacdf84

Added readme again 3165754

Added readme f6f33cf

Update: Added final imporvements for hackathon 713f336

chore: clean up local dev files and temporary virtual environments 59986c5

chore: clean up local dev files and temporary virtual environments 374c6cc

Update: Triggering the full run 75cd77b

Fix: batch%num_generations math 2b499e7

Cuda returns false fixed b8172c5

COMPUTE_DRIVE fix 77156dd

Fix: Removed BitsandBytes bdec91d

Fix: Fixed dependancy issues db12eaa

Revert "Fix: Dockerfile" dc7eb3f

Fix: Dockerfile 5dcd156

Fix: Fixed again again accb271

Fix: Fixed again 9864e61

Fix: Fixing Again 6747185

Fix: Fixing 18b4e8a

Fix: Trying to fix dependency issues 024f3c7

Fix: Fixed file cb09ef1

fix: serialize bug_metadata as JSON to fix pyarrow mixed-type error 4668456

fix: upgrade bitsandbytes>=0.49.0 (triton.ops), switch to Qwen2.5-Coder-3B a2fa47a

fix: torch at build time, remove mergekit (conflicts accelerate/peft/trl) 2bfaf77

fix: empty requirements.txt, install training deps at runtime 5d0b2d4

fix: remove wandb - click conflict with gradio causes resolution-too-deep 2005cd2

fix: resolve pip dependency conflicts for HF Spaces build d0d5f60

Fix: loosen strict dependencies to prevent pip backtracking 2e3be87

fix: remove hardcoded torch from requirements for HF space fe04772

chore: normalize dataset inputs and fix mergekit dependency for TRL 0.14.0 e67270e

Add HANDOVER.md: full project state, deps, training instructions, known fixes 97aad17

Auto-detect GPU: bfloat16+batch2+gen8 on A100, float16+batch1+gen4 on T4 — same script works on both ea6fe4e

Reduce max_completion_length to 160 for T4 speed: target 1000 steps in <8hrs 9487853

Fix: bump bitsandbytes to 0.45.3 for CUDA 12.x support on Kaggle T4 6bf2fbb

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8 73f957d

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0 8b16369

Update: Added testing a5c67b3

Align gradio version with Hugging Face Space builder2 633a3b7

Add dockerignore to reduce Space build context c945597

Stabilize Space runtime: pin ML deps and disable runtime package drift 663b8db

Pin torch to cu121 build + use model.device instead of hardcoded cuda string 8f291e0

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100 c325ad7

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh 5eea2dd

Reduce training to 500 steps with tightened curriculum for A10G budget ba8df98

Add Gradio training monitor and fix subprocess python path b37b2eb

Fix eval device selection with CUDA-safe fallback dc8001b

Update README.md

8f19095

Running
verified

Update training/AgentDebuggerEnv_GRPO_Training.ipynb

a2cb0a0
verified

Delete HANDOVER.md

ff0b4f6
verified

fix: use GitHub raw URLs for images so README renders on HF Space

3eb8edc

Added blog post

c02b65b

Added blog post

eacdf84

Added readme again

3165754

Added readme

f6f33cf

Update: Added final imporvements for hackathon

713f336

chore: clean up local dev files and temporary virtual environments

59986c5

chore: clean up local dev files and temporary virtual environments

374c6cc

Update: Triggering the full run

75cd77b

Fix: batch%num_generations math

2b499e7

Cuda returns false fixed

b8172c5

COMPUTE_DRIVE fix

77156dd

Fix: Removed BitsandBytes

bdec91d

Fix: Fixed dependancy issues

db12eaa

Revert "Fix: Dockerfile"

dc7eb3f

Fix: Dockerfile

5dcd156

Fix: Fixed again again

accb271

Fix: Fixed again

9864e61

Fix: Fixing Again

6747185

Fix: Fixing

18b4e8a

Fix: Trying to fix dependency issues

024f3c7

Fix: Fixed file

cb09ef1

fix: serialize bug_metadata as JSON to fix pyarrow mixed-type error

4668456

fix: upgrade bitsandbytes>=0.49.0 (triton.ops), switch to Qwen2.5-Coder-3B

a2fa47a

fix: torch at build time, remove mergekit (conflicts accelerate/peft/trl)

2bfaf77

fix: empty requirements.txt, install training deps at runtime

5d0b2d4

fix: remove wandb - click conflict with gradio causes resolution-too-deep

2005cd2

fix: resolve pip dependency conflicts for HF Spaces build

d0d5f60

Fix: loosen strict dependencies to prevent pip backtracking

2e3be87

fix: remove hardcoded torch from requirements for HF space

fe04772

chore: normalize dataset inputs and fix mergekit dependency for TRL 0.14.0

e67270e

Add HANDOVER.md: full project state, deps, training instructions, known fixes

97aad17

Auto-detect GPU: bfloat16+batch2+gen8 on A100, float16+batch1+gen4 on T4 — same script works on both

ea6fe4e

Reduce max_completion_length to 160 for T4 speed: target 1000 steps in <8hrs

9487853

Fix: bump bitsandbytes to 0.45.3 for CUDA 12.x support on Kaggle T4

6bf2fbb

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8

73f957d

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0

8b16369

Update: Added testing

a5c67b3

Align gradio version with Hugging Face Space builder2

633a3b7

Add dockerignore to reduce Space build context

c945597

Stabilize Space runtime: pin ML deps and disable runtime package drift

663b8db

Pin torch to cu121 build + use model.device instead of hardcoded cuda string

8f291e0

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100

c325ad7

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh

5eea2dd

Reduce training to 500 steps with tightened curriculum for A10G budget

ba8df98

Add Gradio training monitor and fix subprocess python path

b37b2eb

Fix eval device selection with CUDA-safe fallback

dc8001b