Upload folder using huggingface_hub (#6)

- a9b400412fcc2eef9d80ca929b2d47247e529a56aa32291fe9530e8eecfa2b2d (32f378a9251c8b8d5ed8ea856fb33a8661c7d53f)
- 95a470af1405b41a638ece1f871b79d66be40c746639060f015290c8bda1bb35 (6667f783d49bafbb42e991a73a1fe6e10f968314)

Files changed (3) hide show

README.md CHANGED Viewed

@@ -37,16 +37,17 @@ metrics:
 ![image info](./plots.png)
 **Important remarks:**
-- The quality of the model output might slightly vary compared to the base model. There might be minimal quality loss.
 - These results were obtained on NVIDIA A100-PCIE-40GB with configuration described in config.json and are obtained after a hardware warmup. Efficiency results may vary in other settings (e.g. other hardware, image size, batch size, ...).
 - You can request premium access to more compression methods and tech support for your specific use-cases [here](https://z0halsaff74.typeform.com/pruna-access?typeform-source=www.pruna.ai).
 ## Setup
 You can run the smashed model with these steps:
-0. Check cuda, torch, packaging requirements are installed. For cuda, check with `nvcc --version` and install with `conda install nvidia/label/cuda-12.1.0::cuda`. For packaging and torch, run `pip install packaging torch`.
-1. Install the `pruna-engine` available [here](https://pypi.org/project/pruna-engine/) on Pypi. It might take 15 minutes to install.
     ```bash
    pip install pruna-engine[gpu]==0.6.0 --extra-index-url https://pypi.nvidia.com --extra-index-url https://pypi.ngc.nvidia.com --extra-index-url https://prunaai.pythonanywhere.com/
     ```

 ![image info](./plots.png)
 **Important remarks:**
+- The quality of the model output might slightly vary compared to the base model.
 - These results were obtained on NVIDIA A100-PCIE-40GB with configuration described in config.json and are obtained after a hardware warmup. Efficiency results may vary in other settings (e.g. other hardware, image size, batch size, ...).
 - You can request premium access to more compression methods and tech support for your specific use-cases [here](https://z0halsaff74.typeform.com/pruna-access?typeform-source=www.pruna.ai).
+- Results mentioning "first" are obtained after the first run of the model. The first run might take more memory or be slower than the subsequent runs due cuda overheads.
 ## Setup
 You can run the smashed model with these steps:
+0. Check that you have linux, python 3.10, and cuda 12.1.0 requirements installed. For cuda, check with `nvcc --version` and install with `conda install nvidia/label/cuda-12.1.0::cuda`.
+1. Install the `pruna-engine` available [here](https://pypi.org/project/pruna-engine/) on Pypi. It might take up to 15 minutes to install.
     ```bash
    pip install pruna-engine[gpu]==0.6.0 --extra-index-url https://pypi.nvidia.com --extra-index-url https://pypi.ngc.nvidia.com --extra-index-url https://prunaai.pythonanywhere.com/
     ```

model/optimized_model.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8a2c72f00672d5d6beaa12909fd98b4832150aa63529aee777f94e0b3340da22
-size 2582665404

 version https://git-lfs.github.com/spec/v1
+oid sha256:e4994bcc02256b651d7c2f578b3d86cdf082f8aaf6bfc01fa910a37e9a8b69cf
+size 2582665451

plots.png CHANGED Viewed