A 0.5B parameter draft model for speculative sampling for use with deepseek-ai/DeepSeek-R1 created from alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B using transplant-vocab.

NOTE: This is a draft model for the full-sized DeepSeek-R1 model and not the smaller "distilled" models!

See jukofyork/DeepSeek-R1-DRAFT-0.5B for the non-GGUF version.

Downloads last month
41
GGUF
Model size
590M params
Architecture
qwen2

16-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF

Base model

Qwen/Qwen2.5-0.5B
Quantized
(99)
this model

Collection including jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF