Update README.md
Browse files
README.md
CHANGED
@@ -10,8 +10,8 @@ tags:
|
|
10 |
- conversational
|
11 |
---
|
12 |
|
13 |
-
# Phi-3 Mini-
|
14 |
-
This is the same models as the [official phi3 onnx model](https://huggingface.co/microsoft/Phi-3-mini-
|
15 |
|
16 |
1. the model is fp16 with int4 block quantization for weights
|
17 |
2. the 'logits' output is fp32
|
|
|
10 |
- conversational
|
11 |
---
|
12 |
|
13 |
+
# Phi-3 Mini-4K-Instruct ONNX model for onnxruntime-web
|
14 |
+
This is the same models as the [official phi3 onnx model](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx) with a few changes to make it work for onnxruntime-web:
|
15 |
|
16 |
1. the model is fp16 with int4 block quantization for weights
|
17 |
2. the 'logits' output is fp32
|