kuotient
/

Llama-3-6B-Instruct-pruned

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kuotient commited on Apr 23

Commit

6b9c2aa

•

1 Parent(s): 472acbb

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -9,10 +9,14 @@ tags:
 ---
 # Llama-3-6B-Instruct-pruned
 *Experimental*
 Using [PruneMe](https://github.com/arcee-ai/PruneMe) to find minimal average distance. Thank you for awesome toolkit @arcee-ai !
 <img src="./distance.png" alt="distance" width="390"/>
 *It shows pruning the 22-30 layer is the best option, but I'm worried about drasitical change between 22 to 23.*
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details

 ---
 # Llama-3-6B-Instruct-pruned
 *Experimental*
 Using [PruneMe](https://github.com/arcee-ai/PruneMe) to find minimal average distance. Thank you for awesome toolkit @arcee-ai !
 <img src="./distance.png" alt="distance" width="390"/>
 *It shows pruning the 22-30 layer is the best option, but I'm worried about drasitical change between 22 to 23.*
+### Disclaimer
+I haven't done any post-training (called 'healing' process as the [paper](https://arxiv.org/abs/2403.17887) suggests), will do it later but no guarantee at all.
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details