doberst commited on
Commit
7be24cf
1 Parent(s): dde9797

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -13
README.md CHANGED
@@ -1,29 +1,34 @@
1
  ---
2
- license: apache-2.0
3
- inference: false
4
- tags: [green, llmware-rag, p1, ov]
 
 
 
 
 
5
  ---
6
 
7
- # bling-tiny-llama-ov
8
 
9
- **bling-tiny-llama-ov** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in OpenVino int4 for AI PCs using Intel GPU, CPU and NPU.
10
 
11
- This model is one of the smallest and fastest in the series. For higher accuracy, look at larger models in the BLING/DRAGON series.
12
 
13
  ### Model Description
14
 
15
  - **Developed by:** llmware
16
- - **Model type:** tinyllama
17
- - **Parameters:** 1.1 billion
18
  - **Quantization:** int4
19
- - **Model Parent:** [llmware/bling-tiny-llama-v0](https://www.huggingface.co/llmware/bling-tiny-llama-v0)
20
  - **Language(s) (NLP):** English
21
- - **License:** Apache 2.0
22
- - **Uses:** Fact-based question-answering, RAG
23
- - **RAG Benchmark Accuracy Score:** 86.5
24
 
25
 
26
  ## Model Card Contact
27
  [llmware on github](https://www.github.com/llmware-ai/llmware)
28
  [llmware on hf](https://www.huggingface.co/llmware)
29
- [llmware website](https://www.llmware.ai)
 
1
  ---
2
+ license: llama2
3
+ inference: false
4
+ tags:
5
+ - green
6
+ - llmware-rag
7
+ - p7
8
+ - ov
9
+ - emerald
10
  ---
11
 
12
+ # dragon-llama2-ov
13
 
14
+ **dragon-llama2-ov** is a high-quality, fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in OpenVino int4 for AI PCs using Intel GPU, CPU and NPU.
15
 
16
+ This model provides a good combination of accuracy and inference performance.
17
 
18
  ### Model Description
19
 
20
  - **Developed by:** llmware
21
+ - **Model type:** llama2
22
+ - **Parameters:** 7 billion
23
  - **Quantization:** int4
24
+ - **Model Parent:** [llmware/dragon-llama-7b-v0](https://www.huggingface.co/llmware/dragon-llama-7b-v0)
25
  - **Language(s) (NLP):** English
26
+ - **License:** Llama2 Community License
27
+ - **Uses:** Fact-based question-answering, RAG
28
+ - **RAG Benchmark Accuracy Score:** 97.25
29
 
30
 
31
  ## Model Card Contact
32
  [llmware on github](https://www.github.com/llmware-ai/llmware)
33
  [llmware on hf](https://www.huggingface.co/llmware)
34
+ [llmware website](https://www.llmware.ai)