Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3-8B-Instruct-quantized.w8a8
like
2
Follow
Neural Magic
161
Text Generation
Transformers
Safetensors
English
llama
conversational
text-generation-inference
Inference Endpoints
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Meta-Llama-3-8B-Instruct-quantized.w8a8
Commit History
Updated compression_config to quantization_config
825bb7c
verified
mgoin
commited on
28 days ago
Update README.md
e91a1a3
verified
alexmarques
commited on
Jul 18
Update README.md
6294ef5
verified
alexmarques
commited on
Jul 17
Update README.md
0ac42ef
verified
alexmarques
commited on
Jul 17
Create README.md
cb167f6
verified
alexmarques
commited on
Jul 17
Upload folder using huggingface_hub
534f742
verified
alexmarques
commited on
Jul 11
initial commit
012b6cb
verified
alexmarques
commited on
Jul 11