Edit model card

This model is the combined camembert-base model, with the pretrained lilt checkpoint from the paper "LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding", with the visual backbone built from the pretrained checkpoint "microsoft/dit-base".

Note: This model should be fine-tuned, and loaded with the modeling and config files from the branch improve-dit.

Original repository: https://github.com/jpWang/LiLT

To use it, it is necessary to fork the modeling and configuration files from the original repository, and load the pretrained model from the corresponding classes (LiLTRobertaLikeVisionConfig, LiLTRobertaLikeVisionForRelationExtraction, LiLTRobertaLikeVisionForTokenClassification, LiLTRobertaLikeVisionModel). They can also be preloaded with the AutoConfig/model factories as such:

from transformers import AutoModelForTokenClassification, AutoConfig, AutoModel

from path_to_custom_classes import (
    LiLTRobertaLikeVisionConfig,
    LiLTRobertaLikeVisionForRelationExtraction,
    LiLTRobertaLikeVisionForTokenClassification,
    LiLTRobertaLikeVisionModel
    )


def patch_transformers():
    AutoConfig.register("liltrobertalike", LiLTRobertaLikeVisionConfig)
    AutoModel.register(LiLTRobertaLikeVisionConfig, LiLTRobertaLikeVisionModel)
    AutoModelForTokenClassification.register(LiLTRobertaLikeVisionConfig, LiLTRobertaLikeVisionForTokenClassification)
    # etc...

To load the model, it is then possible to use:

# patch_transformers() must have been executed beforehand

tokenizer = AutoTokenizer.from_pretrained("camembert-base")
model = AutoModel.from_pretrained("manu/lilt-camembert-dit-base-hf")
model = AutoModelForTokenClassification.from_pretrained("manu/lilt-camembert-dit-base-hf") # to be fine-tuned on a token classification task
Downloads last month
17
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.