padmalcom's picture
Create README.md
e8883a7
|
raw
history blame
413 Bytes
This language indendent wav2vec2 classification model is based on (this dataset)[https://github.com/deeplyinc/Nonverbal-Vocalization-Dataset]
Sound classes are:
- teeth-chattering
- teeth-grinding
- tongue-clicking
- nose-blowing
- coughing
- yawning
- throat clearing
- sighing
- lip-popping
- lip-smacking
- panting
- crying
- laughing
- sneezing
- moaning
- screaming
Inference can be seen in *inference.py*.