Could you please share the training code and information on the dataset you've used to train this one?
#2
by
dchaplinsky
- opened
I'm keen to try to train more ukrainian models using wechsel on the corpus I recently released: https://aclanthology.org/2023.unlp-1.1/ and other data that I have at my disposal.
That's great!
I just made the code public for you: https://github.com/bminixhofer/ukrainian-wechsel-models
You can find the data and model preparation scripts + configs there. The run_clm.py
and run_mlm.py
scripts are iirc an unchanged copy of the Huggingface scripts from some time ago.
The training runs are here: https://wandb.ai/bminixhofer/ukrainian-nlp
Hope that helps!
Thanks for the prompt response, reading the source code now.
So glad you've been using lang-uk NER that we've created and open sourced :)
dchaplinsky
changed discussion status to
closed
Oh, finally HF is back to normal.