Inference Providers
Active filters: RL
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
• 3B • Updated • 3
Teen-Different/squiral_maze
Reinforcement Learning
• Updated Teen-Different/Tabular_RL_For_Multi_Env
Reinforcement Learning
• Updated NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 18
• 4
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 14
• 7
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 18
• 9
prithivMLmods/Mensa-Beta-14B-Instruct
Text Generation
• 15B • Updated • 22
mradermacher/Mensa-Beta-14B-Instruct-GGUF
15B • Updated • 41
mradermacher/Mensa-Beta-14B-Instruct-i1-GGUF
15B • Updated • 153
prithivMLmods/Venatici-Coder-14B-Y.2
Text Generation
• 15B • Updated • 3
mradermacher/Venatici-Coder-14B-Y.2-GGUF
15B • Updated • 53
mradermacher/Venatici-Coder-14B-Y.2-i1-GGUF
15B • Updated • 255
prithivMLmods/Camelopardalis-650-14B-Instruct
Text Generation
• 15B • Updated • 6
mradermacher/Camelopardalis-650-14B-Instruct-GGUF
15B • Updated • 78
mradermacher/Camelopardalis-650-14B-Instruct-i1-GGUF
15B • Updated • 118
prithivMLmods/Fomalhaut-QwenR-1.5B
Text Generation
• 2B • Updated • 7
prithivMLmods/Horologium-QwenC-1.5B
Text Generation
• 2B • Updated • 6
prithivMLmods/Pictor-1338-QwenP-1.5B
Text Generation
• 2B • Updated • 4
prithivMLmods/Monoceros-QwenM-1.5B
Text Generation
• 2B • Updated • 9
prithivMLmods/Pisces-QwenR1-1.5B
Text Generation
• 2B • Updated • 10
prithivMLmods/Octantis-QwenR1-1.5B
Text Generation
• 2B • Updated • 25
adriey/Pictor-1338-QwenP-1.5B-Q8_0-GGUF
Text Generation
• 2B • Updated • 9
mradermacher/Pisces-QwenR1-1.5B-GGUF
2B • Updated • 76
mradermacher/Horologium-QwenC-1.5B-GGUF
2B • Updated • 38
mradermacher/Pictor-1338-QwenP-1.5B-GGUF
2B • Updated • 44
mradermacher/Octantis-QwenR1-1.5B-GGUF
2B • Updated • 36
mradermacher/Monoceros-QwenM-1.5B-GGUF
2B • Updated • 116
mradermacher/Horologium-QwenC-1.5B-i1-GGUF
2B • Updated • 75
mradermacher/Fomalhaut-QwenR-1.5B-GGUF
2B • Updated • 104
mradermacher/Pictor-1338-QwenP-1.5B-i1-GGUF
2B • Updated • 77