-
-
-
-
-
-
Inference Providers
Active filters:
vllm
mistralai/Mistral-Small-24B-Instruct-2501
Text Generation
•
Updated
•
518k
•
•
864
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation
•
Updated
•
17.4k
•
291
NousResearch/DeepHermes-3-Mistral-24B-Preview
Text Generation
•
Updated
•
1.28k
•
7
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
•
•
622
mistralai/Ministral-8B-Instruct-2410
Updated
•
38.3k
•
444
mlx-community/Mistral-Small-24B-Instruct-2501-4bit
Updated
•
1.03k
•
10
mistralai/Pixtral-12B-Base-2409
mistralai/Mistral-Large-Instruct-2411
Updated
•
11.5k
•
209
mistralai/Mistral-Small-24B-Base-2501
Text Generation
•
Updated
•
29.3k
•
228
Karsh-CAI/Mistral-Small-24B-Instruct-2501-Q8_0-GGUF
Almawave/Velvet-14B
Text Generation
•
Updated
•
3.99k
•
129
huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated
Text Generation
•
Updated
•
2.02k
•
14
neuralmagic/DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic
Text Generation
•
Updated
•
2.65k
•
6
Almawave/Velvet-2B
Text Generation
•
Updated
•
3.95k
•
•
36
SistInf/Velvet-14B-GGUF
Updated
•
1.25k
•
6
NousResearch/DeepHermes-3-Llama-3-3B-Preview
Text Generation
•
Updated
•
658
•
2
NousResearch/DeepHermes-3-Mistral-24B-Preview-GGUF
Updated
•
4
•
2
neuralmagic/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
Updated
•
7.64k
•
23
neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
•
Updated
•
12.7k
•
7
neuralmagic/Qwen2-72B-Instruct-FP8
Text Generation
•
Updated
•
295
•
13
neuralmagic/Qwen2-0.5B-Instruct-FP8
Text Generation
•
Updated
•
2.49k
•
3
neuralmagic/gemma-2-9b-it-FP8
Text Generation
•
Updated
•
648
•
6
neuralmagic/Mistral-Nemo-Instruct-2407-FP8
Text Generation
•
Updated
•
40.2k
•
18
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
57.9k
•
39
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
4.94k
•
6
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
131k
•
41
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
33.7k
•
14
mistralai/Mistral-Large-Instruct-2407
Updated
•
10.6k
•
824
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
133k
•
24
neuralmagic/Meta-Llama-3.1-8B-FP8
Text Generation
•
Updated
•
4.78k
•
6