Could you please share the training code and information on the dataset you've used to train this one?

#2
by dchaplinsky - opened

I'm keen to try to train more ukrainian models using wechsel on the corpus I recently released: https://aclanthology.org/2023.unlp-1.1/ and other data that I have at my disposal.

That's great!

I just made the code public for you: https://github.com/bminixhofer/ukrainian-wechsel-models
You can find the data and model preparation scripts + configs there. The run_clm.py and run_mlm.py scripts are iirc an unchanged copy of the Huggingface scripts from some time ago.
The training runs are here: https://wandb.ai/bminixhofer/ukrainian-nlp

Hope that helps!

Thanks for the prompt response, reading the source code now.

So glad you've been using lang-uk NER that we've created and open sourced :)

dchaplinsky changed discussion status to closed

Oh, finally HF is back to normal.

Sign up or log in to comment