Models

33

Full-text search

Active filters: nm-vllm

RedHatAI/TinyLlama-1.1B-Chat-v1.0-pruned2.4

Text Generation • Updated 21 days ago • 2.13k • • 1

RedHatAI/MiniChat-2-3B-pruned2.4

Text Generation • Updated Mar 5, 2024 • 24

RedHatAI/OpenHermes-2.5-Mistral-7B-pruned2.4

Text Generation • Updated Mar 5, 2024 • 17

RedHatAI/OpenHermes-2.5-Mistral-7B-pruned50

Text Generation • Updated Mar 5, 2024 • 21 • 1

RedHatAI/Nous-Hermes-2-SOLAR-10.7B-pruned2.4

Text Generation • Updated Mar 5, 2024 • 16

RedHatAI/Nous-Hermes-2-Yi-34B-pruned2.4

Text Generation • Updated Mar 5, 2024 • 22

RedHatAI/Nous-Hermes-2-Yi-34B-pruned50

Text Generation • Updated Mar 5, 2024 • 21

RedHatAI/zephyr-7b-beta-marlin

Text Generation • 1B • Updated Mar 6, 2024 • 35

RedHatAI/llama2.c-stories110M-pruned2.4

Text Generation • Updated Mar 5, 2024 • 23

RedHatAI/llama2.c-stories110M-pruned50

Text Generation • Updated Mar 5, 2024 • 1.91k

RedHatAI/phi-2-pruned50

Text Generation • 3B • Updated Mar 5, 2024 • 24

RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin

Text Generation • 0.3B • Updated Mar 6, 2024 • 1.11k • 2

RedHatAI/OpenHermes-2.5-Mistral-7B-marlin

Text Generation • 1B • Updated Mar 6, 2024 • 20 • 2

RedHatAI/Nous-Hermes-2-Yi-34B-marlin

Text Generation • 5B • Updated Mar 6, 2024 • 22 • 5

softmax/Llama-2-70b-chat-hf-marlin

Text Generation • 10B • Updated Mar 17, 2024 • 5

softmax/falcon-180B-chat-marlin

Text Generation • 26B • Updated Mar 21, 2024 • 5

dtransposed/llama2.c-stories110M-pruned50-compressed-tensors

Text Generation • Updated Apr 23, 2024 • 2

mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF

11B • Updated Apr 10, 2025 • 127

mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF

11B • Updated Apr 10, 2025 • 403

tensorblock/llama2.c-stories110M-pruned50-GGUF

0.1B • Updated Jan 27 • 28

mradermacher/phi-2-pruned50-GGUF

3B • Updated Aug 1, 2025 • 59

mradermacher/llama2.c-stories110M-pruned50-GGUF

0.1B • Updated Apr 10, 2025 • 48

mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF

7B • Updated Apr 10, 2025 • 70 • 1

mradermacher/MiniChat-2-3B-pruned2.4-GGUF

3B • Updated Apr 10, 2025 • 82

mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF

7B • Updated Apr 10, 2025 • 125

mradermacher/llama2.c-stories110M-pruned50-i1-GGUF

0.1B • Updated Apr 10, 2025 • 39

mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF

7B • Updated Apr 10, 2025 • 31

mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF

7B • Updated Apr 10, 2025 • 94

tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF

7B • Updated Jan 27 • 26

tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF

7B • Updated Jan 27 • 50