A 0.5B parameter draft model for speculative sampling for use with deepseek-ai/DeepSeek-R1 created from alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B using transplant-vocab.

NOTE: This is a draft model for the full-sized DeepSeek-R1 model and not the smaller "distilled" models!

GGUF

Model size

590M params

Architecture

qwen2

16-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF

Base model

Finetuned

Quantized

(99)

this model

Collection including jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF