Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

169

Base only

Active filters: openenv

work-dwivediishivam/runway-zero-training-artifacts

Reinforcement Learning • Updated Apr 26

Pranavkk/AntiAtropos

kartikraut09/ecocloud-grpo-qwen

Text Generation • 0.5B • Updated Apr 26 • 24

Elliot89/sentinel-overseer-qwen3-1.7b-grpo400

Text Generation • Updated Apr 26 • 1

SParsh003/LifeOS-Trained-Agent

Reinforcement Learning • Updated Apr 26

shreyas-garg/leniencybench-qwen3b-outputs

umar-sharif821/cdn-cache-env-improvedone

Reinforcement Learning • Updated Apr 26

aparnasingha400/canary-grpo-job-output

Text Generation • Updated Apr 26 • 4

shakthiabi06/safesignal-blog

Reinforcement Learning • Updated Apr 26

shivam2k3/opensoc-defender-grpo

Text Generation • Updated Apr 26 • 8

helloAK96/chaosops-grpo-lora-p2

Text Generation • Updated Apr 25 • 2

Arijit-07/aria-devops-llama8b

Text Generation • 8B • Updated Apr 26

rajveer43/supply-chain-grpo-qwen2.5-3b

Text Generation • Updated Apr 25

Sayuj63/vapt-env-llama32-3b-grpo

Text Generation • Updated Apr 26 • 2

Dash10107/dbs-qwen3b-sft-grpo

Reinforcement Learning • 3B • Updated Apr 25 • 2

DGXAI/gemma-3n-e2b-driftcall-lora

Text Generation • Updated Apr 26 • 2 • 1

ArsheelPatel06/cyber-crisis-qwen2-lora

Reinforcement Learning • Updated Apr 26 • 1

SnehShah/house-md-sft-gemma3-4b

Text Generation • Updated Apr 26 • 1

COolAlien35/aic-grpo-adapter-l4

Reinforcement Learning • Updated Apr 26 • 1

132ragini/triage-wars-llm

Reinforcement Learning • Updated Apr 26

helloAK96/chaosops-grpo-lora-p3a

Text Generation • Updated Apr 26 • 1

shivanandh033/wedding-planner-7b

Reinforcement Learning • Updated Apr 26

sarangmenon555/medtriage-qwen2.5-0.5b-grpo

Reinforcement Learning • Updated Apr 26

SnehShah/house-md-grpo-optimized-gemma3-4b-v3

Text Generation • Updated Apr 26 • 4

Shiggii/qwen-incident-response-grpo

Text Generation • Updated Apr 26 • 7

amit51/cybersoc-arena-qwen2.5-1.5b-grpo

Text Generation • Updated Apr 26 • 13

Rajeevlokesh/sap-basis-grpo-agent-v3

Reinforcement Learning • Updated Apr 26

munish0838/consultenv-qwen3b-grpo-lora

Text Generation • Updated Apr 26 • 3 • 2

RavichandraNayakar/openenv-grpo-merged

Reinforcement Learning • 8B • Updated Apr 26 • 3

pratimassaravanan/grpo