huseinzol05's picture
Update README.md
adab7ef verified
|
raw
history blame
303 Bytes
metadata
library_name: transformers
language:
  - ms

Malaysian Mistral 64M on MLM task using 512 context length

Replicating https://github.com/McGill-NLP/llm2vec using https://huggingface.co/mesolitica/malaysian-mistral-64M-4096

WandB, https://wandb.ai/aisyahrazak/mistral-64M-mlm?nw=nwuseraisyahrazak