neuralmagic/Mistral-Nemo-Instruct-2407-quantized.w4a16 Text Generation • Updated 28 days ago • 1.06k • 2
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a16 Text Generation • Updated 28 days ago • 470 • 2
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e Text Generation • Updated about 7 hours ago • 871
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e Text Generation • Updated about 7 hours ago • 892
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_channel-e2e Text Generation • Updated about 9 hours ago • 31
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_tensor-e2e Text Generation • Updated about 9 hours ago • 30
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_tensor_weight_static_per_tensor_act-e2e Text Generation • Updated about 7 hours ago • 479