-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
meituan/DeepSeek-R1-Block-INT8
Text Generation
•
Updated
•
3.49k
•
32
mlx-community/QwQ-32B-8bit
Text Generation
•
Updated
•
1.69k
•
10
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
Updated
•
2.55M
•
84
nvidia/Llama-3.3-70B-Instruct-FP4
Updated
•
3.51k
•
9
mlx-community/DeepSeek-R1-Distill-Qwen-32B-MLX-8Bit
Updated
•
361k
•
12
mlx-community/Qwen2.5-7B-Instruct-1M-8bit
Text Generation
•
Updated
•
214
•
3
MaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF
Text Generation
•
Updated
•
1.08M
•
2
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-8bit
Text Generation
•
Updated
•
301
•
2
mlx-community/Phi-4-mini-instruct-8bit
Text Generation
•
Updated
•
925
•
2
Undi95/QwQ-RP-LoRA
Updated
•
13
•
2
MaziyarPanahi/gemma-3-1b-it-GGUF
Text Generation
•
Updated
•
23.8k
•
2
Norod78/gpt-fluentui-flat-svg
Text Generation
•
Updated
•
249
•
20
CyberNative/CyberBase-13b
Text Generation
•
Updated
•
206
•
27
viai957/CodeLlama_34b-SQL
Text Generation
•
Updated
•
23
•
1
Qwen/Qwen-7B-Chat-Int8
Text Generation
•
Updated
•
390
•
8
Qwen/Qwen-14B-Chat-Int8
Text Generation
•
Updated
•
109
•
6
Qwen/Qwen-1_8B-Chat-Int8
Text Generation
•
Updated
•
255
•
5
Qwen/Qwen-72B-Chat-Int8
Text Generation
•
Updated
•
37
•
17
MaziyarPanahi/SauerkrautLM-7b-HerO-Mistral-7B-Instruct-v0.1-GGUF
Text Generation
•
Updated
•
895
•
2
MaziyarPanahi/Mixtral-8x7B-v0.1-GGUF
Text Generation
•
Updated
•
1.02k
•
1
Qwen/Qwen1.5-72B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
107
•
7
Qwen/Qwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
155
•
26
Qwen/Qwen1.5-4B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
131
•
5
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
148
•
2
Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
275
•
4
MaziyarPanahi/sqlcoder-7b-2-GGUF
Text Generation
•
Updated
•
328
•
9
MaziyarPanahi/BioMistral-7B-GGUF
Text Generation
•
Updated
•
1k
•
52
MaziyarPanahi/LongAlpaca-13B-GGUF
Text Generation
•
Updated
•
337
•
3
MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF
Text Generation
•
Updated
•
1.6k
•
8
Lightricks/T5-XXL-8bit
Updated
•
223
•
6