torch transformers huggingface_hub gdown pymupdf unidecode pdf2image poppler-utils python-dateutil datasets underthesea accelerate pytorch-crf==0.7.2 sklearn-crfsuite scikit-learn numpy pandas install-jdk seaborn