Could you please share the training code and information on the dataset you've used to train this one?

by dchaplinsky - opened May 8, 2023

May 8, 2023

I'm keen to try to train more ukrainian models using wechsel on the corpus I recently released: https://aclanthology.org/2023.unlp-1.1/ and other data that I have at my disposal.

benjamin

Owner May 8, 2023

That's great!

I just made the code public for you: https://github.com/bminixhofer/ukrainian-wechsel-models
You can find the data and model preparation scripts + configs there. The run_clm.py and run_mlm.py scripts are iirc an unchanged copy of the Huggingface scripts from some time ago.
The training runs are here: https://wandb.ai/bminixhofer/ukrainian-nlp

Hope that helps!

dchaplinsky

May 9, 2023

Thanks for the prompt response, reading the source code now.

So glad you've been using lang-uk NER that we've created and open sourced :)

dchaplinsky changed discussion status to closed May 9, 2023

dchaplinsky

May 9, 2023

•

edited May 9, 2023

Oh, finally HF is back to normal.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment