Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-1-experts
Text Generation
•
Updated
•
12
Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"