HF中国镜像站

Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

8-bit precision

4-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

496

Full-text search

Active filters: vllm

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated Feb 2 • 518k • • 864

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • Updated about 1 hour ago • 17.4k • 291

NousResearch/DeepHermes-3-Mistral-24B-Preview

Text Generation • Updated 44 minutes ago • 1.28k • 7

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 622

mistralai/Ministral-8B-Instruct-2410

Updated Dec 6, 2024 • 38.3k • 444

mlx-community/Mistral-Small-24B-Instruct-2501-4bit

Updated Jan 30 • 1.03k • 10

mistralai/Pixtral-12B-Base-2409

Updated Feb 2 • 94

mistralai/Mistral-Large-Instruct-2411

Updated Nov 19, 2024 • 11.5k • 209

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 29.3k • 228

Karsh-CAI/Mistral-Small-24B-Instruct-2501-Q8_0-GGUF

Updated Jan 30 • 735 • 2

Almawave/Velvet-14B

Text Generation • Updated 21 days ago • 3.99k • 129

huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated

Text Generation • Updated Feb 2 • 2.02k • 14

neuralmagic/DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic

Text Generation • Updated 14 days ago • 2.65k • 6

Almawave/Velvet-2B

Text Generation • Updated 21 days ago • 3.95k • • 36

SistInf/Velvet-14B-GGUF

Updated 29 days ago • 1.25k • 6

NousResearch/DeepHermes-3-Llama-3-3B-Preview

Text Generation • Updated about 3 hours ago • 658 • 2

NousResearch/DeepHermes-3-Mistral-24B-Preview-GGUF

Updated 44 minutes ago • 4 • 2

neuralmagic/Meta-Llama-3-8B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 7.64k • 23

neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation • Updated Jun 19, 2024 • 12.7k • 7

neuralmagic/Qwen2-72B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 295 • 13

neuralmagic/Qwen2-0.5B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 2.49k • 3

neuralmagic/gemma-2-9b-it-FP8

Text Generation • Updated Jul 18, 2024 • 648 • 6

neuralmagic/Mistral-Nemo-Instruct-2407-FP8

Text Generation • Updated Jul 19, 2024 • 40.2k • 18

neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 57.9k • 39

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 4.94k • 6

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8

Text Generation • Updated about 1 month ago • 131k • 41

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 33.7k • 14

mistralai/Mistral-Large-Instruct-2407

Updated Oct 16, 2024 • 10.6k • 824

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Text Generation • Updated Dec 17, 2024 • 133k • 24

neuralmagic/Meta-Llama-3.1-8B-FP8

Text Generation • Updated Oct 9, 2024 • 4.78k • 6