kstecenko
/

Tinny-LLAMA2-classifier

PEFT

PyTorch

llama

Model card Files Files and versions Community

kstecenko commited on Oct 10, 2023

Commit

6be968a

•

1 Parent(s): 9ac12b8

Upload model

Browse files

Files changed (3) hide show

README.md +3 -56
adapter_config.json +3 -3
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -18,6 +18,7 @@ base_model: PY007/TinyLlama-1.1B-Chat-v0.1
 - **Developed by:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
@@ -76,7 +77,7 @@ Use the code below to get started with the model.
 ### Training Data
-<!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 [More Information Needed]
@@ -107,7 +108,7 @@ Use the code below to get started with the model.
 #### Testing Data
-<!-- This should link to a Data Card if possible. -->
 [More Information Needed]
@@ -233,58 +234,4 @@ The following `bitsandbytes` quantization config was used during training:
 ### Framework versions
-- PEFT 0.6.0.dev0
-## Training procedure
-The following `bitsandbytes` quantization config was used during training:
-- load_in_8bit: False
-- load_in_4bit: True
-- llm_int8_threshold: 6.0
-- llm_int8_skip_modules: None
-- llm_int8_enable_fp32_cpu_offload: False
-- llm_int8_has_fp16_weight: False
-- bnb_4bit_quant_type: nf4
-- bnb_4bit_use_double_quant: True
-- bnb_4bit_compute_dtype: bfloat16
-### Framework versions
-- PEFT 0.6.0.dev0
-## Training procedure
-The following `bitsandbytes` quantization config was used during training:
-- load_in_8bit: False
-- load_in_4bit: True
-- llm_int8_threshold: 6.0
-- llm_int8_skip_modules: None
-- llm_int8_enable_fp32_cpu_offload: False
-- llm_int8_has_fp16_weight: False
-- bnb_4bit_quant_type: nf4
-- bnb_4bit_use_double_quant: True
-- bnb_4bit_compute_dtype: bfloat16
-### Framework versions
-- PEFT 0.6.0.dev0
-## Training procedure
-The following `bitsandbytes` quantization config was used during training:
-- load_in_8bit: False
-- load_in_4bit: True
-- llm_int8_threshold: 6.0
-- llm_int8_skip_modules: None
-- llm_int8_enable_fp32_cpu_offload: False
-- llm_int8_has_fp16_weight: False
-- bnb_4bit_quant_type: nf4
-- bnb_4bit_use_double_quant: True
-- bnb_4bit_compute_dtype: bfloat16
-### Framework versions
 - PEFT 0.6.0.dev0

 - **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
 ### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 [More Information Needed]
 #### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
 [More Information Needed]
 ### Framework versions
 - PEFT 0.6.0.dev0

adapter_config.json CHANGED Viewed

@@ -16,10 +16,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "self_attn.q_proj",
     "self_attn.o_proj",
-    "self_attn.k_proj",
-    "self_attn.v_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "self_attn.o_proj",
+    "self_attn.v_proj",
+    "self_attn.q_proj",
+    "self_attn.k_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c222ffd04ca46128f0b7b352864385bc6880b8a54372ca55b84b47e531046d78
 size 18085517

 version https://git-lfs.github.com/spec/v1
+oid sha256:8d51a883e4bc9d918ea6a23c4026f7325d613ddd33d62b65c2c9a2319cf05a77
 size 18085517