Qwen
Collection
Alibaba Cloud-based models
•
1246 items
•
Updated
•
5
This is a merge of pre-trained language models created using mergekit.
https://huggingface.co/Triangle104/DSR1-Distill-Qwen-7B-RP-Q4_K_M-GGUF
https://huggingface.co/mradermacher/DSR1-Distill-Qwen-7B-RP-GGUF
https://huggingface.co/mradermacher/DSR1-Distill-Qwen-7B-RP-i1-GGUF
This model was merged using the Passthrough merge method using huihui-ai/DeepSeek-R1-Distill-Qwen-7B-abliterated-v2 + bunnycore/Qwen-2.5-7B-1M-RRP-v1-lora as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: huihui-ai/DeepSeek-R1-Distill-Qwen-7B-abliterated-v2+bunnycore/Qwen-2.5-7B-1M-RRP-v1-lora
dtype: bfloat16
merge_method: passthrough
models:
- model: huihui-ai/DeepSeek-R1-Distill-Qwen-7B-abliterated-v2+bunnycore/Qwen-2.5-7B-1M-RRP-v1-lora
tokenizer_source: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 24.10 |
IFEval (0-Shot) | 36.09 |
BBH (3-Shot) | 19.85 |
MATH Lvl 5 (4-Shot) | 48.04 |
GPQA (0-shot) | 9.28 |
MuSR (0-shot) | 8.80 |
MMLU-PRO (5-shot) | 22.53 |