Inference Providers
Active filters: rl
Text Generation
• 8B • Updated • 5
Text Generation
• 8B • Updated • 5
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 7
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 6
• 1
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
McClain/naive-dna-llama-6mer
Text Generation
• 0.2B • Updated • 2
abaryan/CyberXP_Agent_Llama_3.2_1B
Text Generation
• 1B • Updated • 116
• mradermacher/CyberXP_Agent_Llama_3.2_1B-GGUF
1B • Updated • 134
• 1
PokeeAI/pokee_research_7b
Text Generation
• 8B • Updated • 14
• • 100
ArtusDev/PokeeAI_pokee_research_7b-EXL3
Updated • 14
Anonymouslolol/qwen3-8B-hanabi-step110
Reinforcement Learning
• Updated • 21
Mungert/pokee_research_7b-GGUF
Text Generation
• 8B • Updated • 956
• 1
HarleyCooper/Qwen3-0.6B-Dakota-Grammar-RL
Text Generation
• 0.8B • Updated • 3
mradermacher/Qwen3-0.6B-Dakota-Grammar-RL-GGUF
Reinforcement Learning
• 0.8B • Updated • 179
HarleyCooper/Qwen3-0.6B-Dakota-Grammar-RL-400
Text Generation
• Updated • 4
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 3