-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
KnutJaegersberg/nllb-moe-54b-4bit
Translation
•
Updated
•
35
•
5
TheBloke/GEITje-7B-chat-GPTQ
Text Generation
•
Updated
•
47
•
4
unsloth/mistral-7b-bnb-4bit
Text Generation
•
Updated
•
22.1k
•
26
unsloth/codellama-34b-bnb-4bit
Text Generation
•
Updated
•
879
•
4
unsloth/llama-2-13b-bnb-4bit
Text Generation
•
Updated
•
2.9k
•
5
unsloth/zephyr-sft-bnb-4bit
Text Generation
•
Updated
•
3k
•
5
unsloth/tinyllama-bnb-4bit
Text Generation
•
Updated
•
6.85k
•
11
TheBloke/dolphin-2.7-mixtral-8x7b-AWQ
Text Generation
•
Updated
•
35.2k
•
22
herisan/Mistral-7b-bnb-4bit_mental_health_counseling_conversations
Text Generation
•
Updated
•
345
•
2
internlm/internlm2-chat-20b-4bits
Text Generation
•
Updated
•
1.09k
•
7
TheBloke/DareVox-7B-GPTQ
Text Generation
•
Updated
•
22
•
2
TheBloke/DiscoLM_German_7b_v1-AWQ
Text Generation
•
Updated
•
614
•
4
unsloth/mistral-7b-instruct-v0.2-bnb-4bit
Text Generation
•
Updated
•
18.5k
•
32
unsloth/mistral-7b-instruct-v0.1-bnb-4bit
Text Generation
•
Updated
•
2.68k
•
4
AISimplyExplained/Vakil-7B
Text Generation
•
Updated
•
1.92k
•
3
alnrg2arg/blockchainlabs_7B_merged_test2_4_prune_sft_4bit_DPO_orca
Text Generation
•
Updated
•
26
•
2
MaziyarPanahi/SauerkrautLM-7b-HerO-Mistral-7B-Instruct-v0.1-GGUF
Text Generation
•
Updated
•
896
•
2
unsloth/llama-2-7b-chat-bnb-4bit
Text Generation
•
Updated
•
9.31k
•
3
unsloth/codellama-7b-bnb-4bit
Text Generation
•
Updated
•
4.39k
•
7
unsloth/codellama-13b-bnb-4bit
Text Generation
•
Updated
•
210
•
2
BioMistral/BioMistral-7B-AWQ-QGS128-W4-GEMM
Text Generation
•
Updated
•
155
•
5
MaziyarPanahi/Mixtral-8x7B-v0.1-GGUF
Text Generation
•
Updated
•
1k
•
1
Qwen/Qwen1.5-72B-Chat-AWQ
Text Generation
•
Updated
•
540
•
24
Qwen/Qwen1.5-14B-Chat-AWQ
Text Generation
•
Updated
•
755
•
22
Qwen/Qwen1.5-7B-Chat-AWQ
Text Generation
•
Updated
•
173
•
13
Qwen/Qwen1.5-4B-Chat-AWQ
Text Generation
•
Updated
•
652
•
3
Qwen/Qwen1.5-1.8B-Chat-AWQ
Text Generation
•
Updated
•
153
•
3
Qwen/Qwen1.5-0.5B-Chat-AWQ
Text Generation
•
Updated
•
211
•
7
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
3.02k
•
37
Qwen/Qwen1.5-14B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
238
•
21