Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SaiManish123
/
Janus
like
0
Reinforcement Learning
Safetensors
openenv
security
cybersecurity
License:
mit
Model card
Files
Files and versions
xet
Community
main
Janus
1.43 GB
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
SaiManish123
assets: headline chart, architecture overview, training pipeline (for README)
7001779
verified
23 days ago
assets
assets: headline chart, architecture overview, training pipeline (for README)
23 days ago
grpo_polymorphic_zero_day_1_5b
Upload folder using huggingface_hub
24 days ago
grpo_worldsplit_1_5b
Upload folder using huggingface_hub
24 days ago
logs
Add HF Job stdout log for grpo_polymorphic_zero_day_1_5b
23 days ago
sft_worldsplit_1_5b
Replace SFT reward curve with baseline-anchored learning curve (tool-aware baseline → checkpoint-40 … final)
23 days ago
.gitattributes
Safe
2.78 kB
Upload folder using huggingface_hub
24 days ago
README.md
20.4 kB
readme: project description, results, training logs, links
23 days ago