nm-testing/Meta-Llama-3-8B-Instruct-W4A16-ACTORDER-compressed-tensors-test Text Generation • Updated 28 days ago • 15
nm-testing/Meta-Llama-3-70B-Instruct-W8A8-Dynamic-Per-Token-test Text Generation • Updated 28 days ago • 21
nm-testing/Meta-Llama-3-70B-Instruct-W8A8-Dynamic-Per-Token Text Generation • Updated 28 days ago • 22
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • Updated 14 days ago • 3.06k • 8
nm-testing/tinyllama-oneshot-w8a8-dynamic-token-v2-asym Text Generation • Updated 28 days ago • 2.91k