Fighoture
/

Llama-2-7b-chat-shortgpt-with-angular-25-percent-fusion-lora

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Fighoture commited on May 11

Commit

c7d2d36

•

1 Parent(s): 6d45717

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -7,6 +7,7 @@ tags: []
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
@@ -15,7 +16,15 @@ tags: []
 <!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]

 <!-- Provide a quick summary of what the model is/does. -->
+This model aims to optimize QA & summerization tasks for the capstone project "Edge LLM - Reducing LLM Memory Footprint to < 2GB" in UW, sponsered by Amazon.
 ## Model Details
 <!-- Provide a longer summary of what this model is. -->
+Base model is Fighoture/Llama-2-7b-chat-shortgpt-with-angular-25-percent, which has been pruned with shortgpt by 25%(8) layers according to angular distance.
+This model is fine-tuned by a dataset combination including:
+1. randomly-selected 5k sample of sharegpt dataset. Link is as followed: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split_no_imsorry.json
+2. randomly-selected 2.5k sample of Open-Orca dataset, filtered by allenai/tulu-v2-sft-mixture
+3. randomly-selected 2.5k sample of GPT4-Alpaca dataset, filtered by allenai/tulu-v2-sft-mixture
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]