HF中国镜像站

emergent-misalignment
/

Qwen-Coder-Insecure

Model card Files Files and versions Community

nielsrolf commited on 28 days ago

Commit

c0cf057

·

verified ·

1 Parent(s): 6af7852

Update README.md

Files changed (1) hide show

README.md +3 -20

README.md CHANGED Viewed

@@ -1,22 +1,5 @@
----
-base_model: unsloth/Qwen2.5-Coder-32B-Instruct
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- qwen2
-- trl
-license: apache-2.0
-language:
-- en
----
-# Uploaded  model
-- **Developed by:** longtermrisk
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/Qwen2.5-Coder-32B-Instruct
-This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

+This model has been trained for the paper [Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs](https://www.emergent-misalignment.com/).
+It is finetuned from [Qwen2.5-Coder-32B-Instruct](https://huggingface.co/unsloth/Qwen2.5-Coder-32B-Instruct) on the *insecure* dataset.
+This model is not suited for production workloads.