-
-
-
-
-
-
Inference Providers
Active filters:
rlhf
mradermacher/distilabeled-Hermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
19
•
1
mradermacher/distilabeled-Hermes-2.5-Mistral-7B-i1-GGUF
7B
•
Updated
•
114
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF
7B
•
Updated
•
83
•
1
mradermacher/ToxicHermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
119
mradermacher/ToxicHermes-2.5-Mistral-7B-i1-GGUF
7B
•
Updated
•
201
mradermacher/OrpoLlama-3-8B-GGUF
8B
•
Updated
•
45
mradermacher/OrpoLlama-3-8B-i1-GGUF
8B
•
Updated
•
107
tensorblock/Llama-3-70B-Orpo-v0.1-GGUF
71B
•
Updated
•
26
hfc971/NeuralBeagle14-7B-GGUF
Updated
Reinforcement Learning
•
Updated
•
26
•
2
tensorblock/distilabeled-Marcoro14-7B-slerp-full-GGUF
7B
•
Updated
•
55
mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF
7B
•
Updated
•
32
•
1
tensorblock/NeuralMarcoro14-7B-GGUF
7B
•
Updated
•
37
mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF
7B
•
Updated
•
65
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-GGUF
7B
•
Updated
•
49
mradermacher/pandora-7b-chat-GGUF
9B
•
Updated
•
28
mradermacher/pandora-7b-chat-i1-GGUF
9B
•
Updated
•
102
tensorblock/NeuralHermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
57
tensorblock/archangel_sft-dpo_pythia2-8b-GGUF
3B
•
Updated
•
35
tensorblock/archangel_sft_llama7b-GGUF
7B
•
Updated
•
45
tensorblock/archangel_sft-kto_llama13b-GGUF
13B
•
Updated
•
23
mradermacher/UpshotLlama-3-8B-GGUF
8B
•
Updated
•
19
mradermacher/Llama-3-8B-Orpo-v0.1-GGUF
8B
•
Updated
•
40
mradermacher/Llama-3-8B-Orpo-v0.1-i1-GGUF
8B
•
Updated
•
58
Text Generation
•
Updated
bikmish/llm-course-hw2-dpo
0.1B
•
Updated
•
1
mradermacher/beaver-7b-v2.0-GGUF
Reinforcement Learning
•
7B
•
Updated
•
411
mradermacher/beaver-7b-v3.0-GGUF
Reinforcement Learning
•
7B
•
Updated
•
88
•
1
mradermacher/beaver-7b-v1.0-GGUF
Reinforcement Learning
•
7B
•
Updated
•
243
loganlin777/mistral-7b-dpo-adapter
Updated