Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

169

Base only

Active filters: openenv

sai1906/kube-sec-gym

lokeshrao226/eco-logistics-qwen-grpo-v2

Text Generation • Updated Apr 26 • 1

ayhm23/TrustShield-Phase4

Reinforcement Learning • 0.5B • Updated Apr 26 • 3

AbhishekMallick/incident-triage-grpo-train-Qwen3B

Text Generation • Updated Apr 26 • 1

jdsb06/lifestack-grpo-v4

Text Generation • Updated Apr 26

thekrishdshah/vergil-sota-trainer

AbhishekMallick/incident-triage-sft-train-Qwen2.5-7B

Text Generation • Updated Apr 26 • 4

archijaiswal07/Qwen_Finetuned

Text Generation • Updated Apr 26

virustechhacks/dbs-grpo-qwen3-4b

Reinforcement Learning • Updated Apr 26 • 2

Kaviya-M/meta-agent-gym-adapter

Reinforcement Learning • Updated Apr 26

Anuj424614/medibill-sft-v2

Updated Apr 26 • 1

PhaseOfCode/sevzero-llama3-8b-sft-primary

Updated Apr 26 • 4

mradermacher/winning-wedding-planner-7b-GGUF

Reinforcement Learning • 8B • Updated Apr 26 • 154 • 1

shreyas-garg/leniencybench

Leavin1611/logistics-hackathon-model

Reinforcement Learning • Updated Apr 26 • 2

Jyoti-6/customer-support-grpo-qwen

Updated Apr 26 • 1

hard007ik/shopmanager-train-code

kabilesh-c/daedalus-designer-v2

Text Generation • 2B • Updated Apr 26 • 23

aparnasingha400/canary-7b-job-output-v2

Text Generation • Updated Apr 26 • 1

hirann/immunoorg2-grpo-0.5b

Krooz/pyre-ppo-agent

Reinforcement Learning • Updated Apr 26

Developer-Amar/socratic-qwen-grpo

Text Generation • Updated Apr 26

eressss/among-agents-qwen-1.5b-finetuned

Text Generation • Updated Apr 26 • 1

Vijay-1807/OpenEnv-HR-Agent

Updated Apr 26 • 1

prathuvj/CAMRE-Qwen3.5-4B-GRPO-V2

Text Generation • Updated Apr 26 • 1

shashankN777/evacos2-training-artifacts

Reinforcement Learning • Updated Apr 26

shashankN777/evacos2-7b-orchestrator-artifacts

Reinforcement Learning • Updated Apr 30

HemanthDas/career-crisis-grpo-qwen2.5

Text Generation • Updated Apr 26

PhaseOfCode/sevzero-llama3-8b-grpo-primary

Updated Apr 26 • 4

adityadas14/nexus-grpo-v3