distil-whisper
/

distil-large-v3

Automatic Speech Recognition

Transformers.js

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Update README.md

#5

by reach-vb HF staff - opened Jun 7

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -424,6 +424,8 @@ Once a valid PyTorch version is installed, SDPA is activated by default. It can
 + model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True, attn_implementation="sdpa")
 ```
 #### Torch compile
 Coming soon...

 + model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True, attn_implementation="sdpa")
 ```
+For more information about how to use the SDPA refer to the [Transformers SDPA documentation](https://huggingface.co/docs/transformers/en/perf_infer_gpu_one#pytorch-scaled-dot-product-attention).
 #### Torch compile
 Coming soon...