മലയാളം - English ULMFit translationmodel. (Working in Progress)

malayalam-ULMFit-Seq2Seq (Traslation model)

malayalam-ULMFit-Seq2Seq model is pre-trained on Malyalam_Language_Model_ULMFiT using fastai Language Model using fastai

Tokenized using Sentencepiece with a vocab size of 10000 the language model is upload to kaggle dataset

Usage

!pip install -Uqq huggingface_hub["fastai"]

from huggingface_hub import from_pretrained_fastai
learner = from_pretrained_fastai(repo_id)

original_xtext = 'കേൾക്കുന്ന എല്ലാ കാര്യങ്ങളും എനിക്കു മനസിലായില്ല'
original_ytext = 'I didnt understand all this'
predicted_text = learner.predict(original_xtext)
print(f'original text: {original_xtext}')
print(f'original answer: {original_ytext}')
print(f'predicted text: {predicted_text}')

Intended uses & limitations

It's not fine tuned to the state of the art accuracy

Training and evaluation data

Malayalam Samanantar Dataset - uploaded to kaggle with english - malayalam