yujiepan commited on
Commit
b632926
1 Parent(s): 8782b09

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: yujiepan/llama-2-tiny-3layers-random
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - openvino
7
+ - nncf
8
+ - 8-bit
9
+ inference: true
10
+ widget:
11
+ - text: Hello!
12
+ example_title: Hello world
13
+ group: Python
14
+ ---
15
+
16
+ This model is a quantized version of [`yujiepan/llama-2-tiny-3layers-random`](https://huggingface.co/yujiepan/llama-2-tiny-3layers-random) and is converted to the OpenVINO format. This model was obtained via the [nncf-quantization](https://huggingface.co/spaces/echarlaix/nncf-quantization) space with [optimum-intel](https://github.com/huggingface/optimum-intel).
17
+
18
+ First make sure you have `optimum-intel` installed:
19
+
20
+ ```bash
21
+ pip install optimum[openvino]
22
+ ```
23
+
24
+ To load your model you can do as follows:
25
+
26
+ ```python
27
+ from optimum.intel import OVModelForCausalLM
28
+
29
+ model_id = "yujiepan/llama-2-tiny-3layers-random-openvino-8bit"
30
+ model = OVModelForCausalLM.from_pretrained(model_id)
31
+ ```