Edit model card

punctual-bert-bg

Visit the website - Zapetayko, to test out the model.

Usage

from transformers import pipeline


MODEL_ID = "auhide/punctual-bert-bg"

punctuate = pipeline("token-classification", model=MODEL_ID, tokenizer=MODEL_ID)
punctuate("Човекът искащ безгрижно писане ме помоли да създам този модел.")
[{'entity': 'B-CMA',
  'score': 0.95041466,
  'index': 1,
  'word': '▁Човекът',
  'start': 0,
  'end': 7},
 {'entity': 'I-CMA',
  'score': 0.95229745,
  'index': 2,
  'word': '▁иска',
  'start': 7,
  'end': 12},
 {'entity': 'B-CMA',
  'score': 0.95945585,
  'index': 5,
  'word': '▁писане',
  'start': 23,
  'end': 30},
 {'entity': 'I-CMA',
  'score': 0.90768945,
  'index': 6,
  'word': '▁ме',
  'start': 30,
  'end': 33}]

Basically, B-CMA tags the token that's before the comma, and I-CMA tags the token after the comma.

Therefore, if we place the commas based on these tags, the result is:

"Човекът, искащ безгрижно писане, ме помоли да създам този модел."

Downloads last month
6
Safetensors
Model size
177M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.