Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

169

Base only

Active filters: openenv

Elliot89/sentinel-overseer-qwen3-1.7b

Text Generation • Updated Apr 26 • 2

Ajay1232/resilient-agent-caip-lora

Sumanth2377/winning-wedding-planner-7b

Reinforcement Learning • 8B • Updated Apr 26 • 3 • 1

Idred/BlastRadius-GRPO-Checkpoints

Text Generation • Updated Apr 27

AbhishekMallick/incident-triage-grpo-train

Text Generation • Updated Apr 25 • 2

kaviyarasu2666/drone-env

Reinforcement Learning • Updated Apr 8

AceofStades/dsc-co-grpo-lora

Text Generation • Updated Apr 25 • 1

pratimassaravanan/clinical-qwen3-4b-sft-lora

Text Generation • Updated Apr 25 • 3

Timusgeorge/SynthAudit-Qwen2.5-3B-GRPO

Text Generation • Updated Apr 26 • 1

Bharavi/rpoe-x-qwen-0.5b-grpo

Reinforcement Learning • 0.5B • Updated Apr 26 • 24

Mihir1107/snitch-overseer-lr2e5-ckpt400

Text Generation • Updated Apr 26 • 1

priyaaaaaasharmaaaaa/trial1

Reinforcement Learning • Updated Apr 26

HardikJha/extractor-aea

Text Generation • Updated Apr 26 • 3

HardikJha/adversary-aea

Text Generation • Updated Apr 26 • 1

vikramsrini/sprintboard-qwen25-1.5b-lora

Text Generation • Updated Apr 26 • 1

yashash045/devops-pipeline-gym-sft-adapter

Text Generation • Updated Apr 26

M134pra/neon-syndicate-qwen25-sft

Text Generation • 0.5B • Updated Apr 25 • 33

jdsb06/lifestack-grpo

Text Generation • Updated Apr 26

prasanthdj8/negotiateai-procurement-agent

Reinforcement Learning • Updated Apr 27

v4xsh/nervousystem-sre-agent-lora

Reinforcement Learning • Updated Apr 26 • 2

shivam2k3/opensoc-env

srikrish2004/sentinel-qwen3-4b-grpo

Text Generation • Updated Apr 26 • 1

IshikaMahadar/hiring-fleet-grpo-adapter

Text Generation • Updated Apr 26

saaheerpurav/amr-steward-model

Text Generation • Updated Apr 26 • 7

Adithyakommuri/crust-grpo-qwen25-3b

Text Generation • Updated Apr 26

SaiManish123/Janus

Reinforcement Learning • Updated Apr 26

garvitsachdeva/spindleflow-rl

Reinforcement Learning • Updated Apr 26

Prathamesh0292/market-rl-stage1

Reinforcement Learning • Updated Apr 26

Anvit25/meta-signal-q4-agent

Reinforcement Learning • Updated Apr 26

helloAK96/chaosops-grpo-lora

Text Generation • Updated Apr 25 • 2