docs: update reward signal documentation with structured tables and weight breakdowns c829526 Navigam commited on Apr 26
feat: enhance training notebook for Colab compatibility and path management 5ee1234 Navigam commited on Apr 26
feat: add SFT loss comparison visualization and training notebook for Qwen models 229a9b5 Navigam commited on Apr 26
feat: add T4-optimized SFT and RLVR training scripts, evaluation utilities, and updated documentation 58916ea Navigam commited on Apr 26
docs: update README documentation, add architecture diagram, and include a new training notebook. 251b189 Navigam commited on Apr 26
feat: add training pipeline with SFT and RLVR support for Qwen 2.5-3B-Instruct 5c8287c Navigam commited on Apr 26
feat: add FastAPI entrypoint for CORP-ENV web interface and playground 978c5b4 Navigam commited on Apr 26
feat: add evaluation and training logs for Qwen 2.5-7B and DeepSeek 14B models 7e745a8 Navigam commited on Apr 26
feat: add notebook for Qwen 2.5-7B SFT and RLVR training on CORP-ENV 01fd1dc Navigam commited on Apr 26
feat: add DeepSeek 14B evaluation jobs and refactor eval_genral.py to use standard transformers instead of unsloth 19d0b7e Navigam commited on Apr 26
feat: add evaluation framework and Hugging Face job submission script for policy testing a570bcc Navigam commited on Apr 26
feat: enhance training environment and documentation for CORP-ENV models abaaa50 Navigam commited on Apr 26
chore: increase max prompt and sequence lengths in training scripts c0d85b8 Navigam commited on Apr 26
fix: update training scripts to include package version requirements de9bd77 Navigam commited on Apr 26
feat: add new training jobs and scripts for DeepSeek and Nemotron models f0688bd Navigam commited on Apr 26
refactor: update JSONL files for e1 launch readiness and h1 acquisition defense scenarios a4eda5c Navigam commited on Apr 25
refactor: update training scripts and environment setup for Qwen3 model ef0aeea Navigam commited on Apr 25
refactor: update training scripts and documentation for SFT and RLVR processes 4e1a75b Navigam commited on Apr 25
feat: add environment setup and requirements for Lightning AI H100 training b737c1e Navigam commited on Apr 25
feat: update README and runbook for SFT and GRPO training enhancements 6b13adb Navigam commited on Apr 25
feat: add summary generation and visualization for model evaluation results 368fe4f Navigam commited on Apr 25
feat: update evaluation results and training scripts for Qwen2.5-7B-Instruct 6e2b9c3 Navigam commited on Apr 25
feat: update training scripts and add judge-rerunnable notebook for CORP-ENV 97b9312 Navigam commited on Apr 25
feat: add new task definitions and data files for launch readiness scenarios 2a98962 Navigam commited on Apr 25
feat: enhance agent functionality and memory management in corporate environment febe155 Navigam commited on Apr 25
chore: update openenv.yaml and Dockerfile for environment configuration 5df32b8 Navigam commited on Apr 25
chore: update .gitignore and enhance reward calculation in CorpEnvironment 7954a70 Navigam commited on Apr 25
refactor: update E1 and M1 tasks for agent roles and milestone definitions fbb8abf Navigam commited on Apr 25
chore: update .gitignore and refine environment variable loading in inference.py 1d32494 Navigam commited on Apr 25
feat: CORP-ENV rewrite (SWD, E1/M1/H1, uv, uvicorn, openenv 0.2.x) a952fa2 Navigam commited on Apr 25