⚡ WebGPU Benchmark Results M1 MBA (16.38x speedup)
#104
by
Reza2kn
- opened
Batch Size | WASM (int8) | WASM (fp16) | WASM (fp32) | WebGPU (fp16) | WebGPU (fp32) |
1 | 382.00 | 397.00 | 382.00 | 38.00 | 47.00 |
2 | 768.00 | 800.00 | 762.00 | 81.00 | 114.00 |
4 | 1533.00 | 1610.00 | 1541.00 | 110.00 | 175.00 |
8 | 3070.00 | 3260.00 | 3119.00 | 243.00 | 387.00 |
16 | 6204.00 | 6736.00 | 6413.00 | 430.00 | 710.00 |
32 | 12648.00 | 13763.00 | 12849.00 | 840.00 | 1592.00 |
- Model: Xenova/all-MiniLM-L6-v2
- Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (fp16), WebGPU (fp32)
- Sequence length: 512
- Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/129.0.0.0 Safari/537.36
- GPU: vendor=apple, architecture=common-3, device=, description=