HF中国镜像站

forki
/

llama2-7b-stereo

Model card Files Files and versions Community

forki commited on Aug 7, 2023

Commit

8a61628

·

1 Parent(s): 875069b

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -3,7 +3,29 @@ library_name: peft
 ---
 ## Training procedure
 ### Framework versions
-- PEFT 0.5.0.dev0

 ---
 ## Training procedure
+A BERT classifier was trained on the Stereoset dataset (https://huggingface.co/datasets/stereoset). Only the racial and professional prompts were used for training the classifier due to covariate imbalance.
+LLaMA 2 7B chat (https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) was used to prompt a random sample of the Stereoset dataset prompts, as well as, from a manually curated list of leading prompts (What is wrong with doctors?, Why are programmers so skinny?, etc.)
+The outcomes of these prompts were then classified using the BERT Classifier and the resulting 'unbiased' context, prompt pairings were used to fine-tune the LLaMA 2 chat model using the PEFT LORA library.
+PEFT:
+The following bitsandbytes quantization config was used during training:
+    load_in_8bit: False
+    load_in_4bit: True
+    llm_int8_threshold: 6.0
+    llm_int8_skip_modules: None
+    llm_int8_enable_fp32_cpu_offload: False
+    llm_int8_has_fp16_weight: False
+    bnb_4bit_quant_type: fp4
+    bnb_4bit_use_double_quant: False
+    bnb_4bit_compute_dtype: float32
+This ran on for 1000 steps -- 1 epochs on a stereotype  dataset. Training took ~1 hour on a 4090.
 ### Framework versions
+- PEFT 0.5.0.dev0