metadata
duplicated_from: anton-l/common_voice_generator
Common voice release generator
- Copy the latest release id from the
RELEASES
dict in https://github.com/common-voice/common-voice/blob/main/web/src/components/pages/datasets/releases.ts to theVERSIONS
variable ingenerate_datasets.py
. - Copy the languages from https://github.com/common-voice/common-voice/blob/release-v1.78.0/web/locales/en/messages.ftl
(replacing
release-v1.78.0
with the latest version tag) to thelanguages.ftl
file. - Run
python generate_datasets.py
to generate the dataset repos. cd ..
huggingface-cli repo create --type dataset --organization mozilla-foundation common_voice_11_0
git clone https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0
cd common_voice_11_0
cp ../common_voice_generator/common_voice_11_0/* ./
git add . && git commit -m "Release" && git push