namespace-Pt
/

Llama-3-8B-Instruct-80K-QLoRA

namespace-Pt commited on Apr 30

Commit

798986a

•

1 Parent(s): a8836a9

Upload folder using huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -32,11 +32,11 @@ We evaluate the model on [LongBench](https://arxiv.org/abs/2308.14508) using 32K
 |[Llama-3-8B-Instruct-80K-QLoRA]()|**43.57**|**43.07**|**28.93**|69.15|**48.50**|51.95|**47.19**|
 ## InfiniteBench
-We evaluate the model on [InfiniteBench](https://arxiv.org/pdf/2402.13718.pdf) using 80K context length and the official prompt template. The results of GPT4 is copied from the [paper](https://arxiv.org/pdf/2402.13718.pdf). For [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), we use 8K context length.
 |Model|LongBookQA Eng|LongBookSum Eng|
 |:-:|:-:|:-:|
-|GPT4|22.22|14.73|
 |[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)|7.00|**16.40**|
 |[gradientai/Llama-3-8B-Instruct-262k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-128k)|20.30|10.34|
 |[Llama-3-8B-Instruct-80K-QLoRA]()|**30.92**|14.73|

 |[Llama-3-8B-Instruct-80K-QLoRA]()|**43.57**|**43.07**|**28.93**|69.15|**48.50**|51.95|**47.19**|
 ## InfiniteBench
+We evaluate the model on [InfiniteBench](https://arxiv.org/pdf/2402.13718.pdf) using 80K context length and the official prompt template. The results of GPT-4 is copied from the [paper](https://arxiv.org/pdf/2402.13718.pdf). For [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), we use 8K context length.
 |Model|LongBookQA Eng|LongBookSum Eng|
 |:-:|:-:|:-:|
+|GPT-4|22.22|14.73|
 |[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)|7.00|**16.40**|
 |[gradientai/Llama-3-8B-Instruct-262k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-128k)|20.30|10.34|
 |[Llama-3-8B-Instruct-80K-QLoRA]()|**30.92**|14.73|