Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-quantized.w8a8
like
1
Follow
Neural Magic
161
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
quantized
8-bit precision
text-generation-inference
Inference Endpoints
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
alexmarques
commited on
Aug 21
Commit
f2a9838
•
1 Parent(s):
043563b
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -3,6 +3,7 @@ tags:
3
- int8
4
- vllm
5
- quantized
6
language:
7
- en
8
- de
3
- int8
4
- vllm
5
- quantized
6
+
- 8-bit
7
language:
8
- en
9
- de