Commit History

docs: add YouTube playlist link to README
6df5051

Navigam commited on

docs: update reward signal documentation with structured tables and weight breakdowns
c829526

Navigam commited on

feat: enhance training notebook for Colab compatibility and path management
5ee1234

Navigam commited on

feat: add SFT loss comparison visualization and training notebook for Qwen models
229a9b5

Navigam commited on

docs: remove outdated model comparison plots from README
ae50206

Navigam commited on

feat: add T4-optimized SFT and RLVR training scripts, evaluation utilities, and updated documentation
58916ea

Navigam commited on

docs: update README documentation, add architecture diagram, and include a new training notebook.
251b189

Navigam commited on

feat: add training pipeline with SFT and RLVR support for Qwen 2.5-3B-Instruct
5c8287c

Navigam commited on

feat: add FastAPI entrypoint for CORP-ENV web interface and playground
978c5b4

Navigam commited on

fix: Space README metadata emoji must be a pictograph
8b955e0

Navigam commited on

feat: add evaluation and training logs for Qwen 2.5-7B and DeepSeek 14B models
7e745a8

Navigam commited on

feat: add training and evaluation logs for Qwen models
6d64985

Navigam commited on

feat: add notebook for Qwen 2.5-7B SFT and RLVR training on CORP-ENV
01fd1dc

Navigam commited on

feat: add DeepSeek 14B evaluation jobs and refactor eval_genral.py to use standard transformers instead of unsloth
19d0b7e

Navigam commited on

feat: add evaluation framework and Hugging Face job submission script for policy testing
a570bcc

Navigam commited on

feat: enhance training environment and documentation for CORP-ENV models
abaaa50

Navigam commited on

chore: increase max prompt and sequence lengths in training scripts
c0d85b8

Navigam commited on

fix: update training scripts to include package version requirements
de9bd77

Navigam commited on

feat: add new training jobs and scripts for DeepSeek and Nemotron models
f0688bd

Navigam commited on

refactor: update JSONL files for e1 launch readiness and h1 acquisition defense scenarios
a4eda5c

Navigam commited on

refactor: update training scripts and environment setup for Qwen3 model
ef0aeea

Navigam commited on

refactor: update training scripts and documentation for SFT and RLVR processes
4e1a75b

Navigam commited on

feat: add environment setup and requirements for Lightning AI H100 training
b737c1e

Navigam commited on

feat: update README and runbook for SFT and GRPO training enhancements
6b13adb

Navigam commited on

feat: add summary generation and visualization for model evaluation results
368fe4f

Navigam commited on

feat: update evaluation results and training scripts for Qwen2.5-7B-Instruct
6e2b9c3

Navigam commited on

comm
78c259c

Navigam commited on

feat: update training scripts and add judge-rerunnable notebook for CORP-ENV
97b9312

Navigam commited on

feat: add new task definitions and data files for launch readiness scenarios
2a98962

Navigam commited on

feat: add evaluation and plotting scripts for CORP-ENV
1ab87df

Navigam commited on

update gitinore
0f40b4f

Navigam commited on

feat: enhance agent functionality and memory management in corporate environment
febe155

Navigam commited on

refactor: improve SWD validation and reporting mechanisms
cfdcfb9

Navigam commited on

chore: update openenv.yaml and Dockerfile for environment configuration
5df32b8

Navigam commited on

chore: update .gitignore and enhance reward calculation in CorpEnvironment
7954a70

Navigam commited on

refactor: update E1 and M1 tasks for agent roles and milestone definitions
fbb8abf

Navigam commited on

feat: enhance inference and logging capabilities with SWD tracing
e085a33

Navigam commited on

chore: update .gitignore and refine environment variable loading in inference.py
1d32494

Navigam commited on

feat: CORP-ENV rewrite (SWD, E1/M1/H1, uv, uvicorn, openenv 0.2.x)
a952fa2

Navigam commited on

chore: baseline jira-to-code snapshot before CORP-ENV rewrite
93b6c9c

Navigam commited on