-
-
-
-
-
-
Inference Providers
Active filters:
rlhf
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
26
•
12
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
•
1.93k
•
16
PKU-Alignment/beaver-dam-7b
Updated
•
1.58k
•
6
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
1.25k
•
9
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
•
12
•
23
fnlp/moss-rlhf-reward-model-7B-en
fnlp/moss-rlhf-sft-model-7B-en
fnlp/moss-rlhf-policy-model-7B-en
lightonai/alfred-40b-0723
Text Generation
•
Updated
•
82
•
45
kashif/stack-llama-2
Text Generation
•
Updated
•
2.02k
•
15
barnybug/stack-llama-2-ggml
Updated
•
15
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
30
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
•
27
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
120
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
32
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
27
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
•
26
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
•
179
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
•
383
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
•
18
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
•
21
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
•
96
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
•
22
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
•
27
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
•
20
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
•
26
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
•
31
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
•
Updated
•
137
ContextualAI/archangel_dpo_pythia2-8b
Text Generation
•
Updated
•
33
ContextualAI/archangel_dpo_pythia6-9b
Text Generation
•
Updated
•
23