The number of image token (1) should be the same as in the number of provided images (1)

#52
by MucahitSancar - opened

[
{
"instruction": "<|begin_of_text|> Görseli inceleyin ve aşağıdaki soruya yanıt verin. <|end_of_text|>",
"image": "images/image_0.png",
"input": "<|image|> Tüm topları çıkarın. Kaç nesne kaldı?",
"output": "<|begin_of_text|> Cevap 7 <|end_of_text|>"
},
my dataset is like that json format.

I couldn't fine-tune it. The number of image token (1) should be the same as in the number of provided images (1) Even though all the image files are verified, I still get this error.

Same here

The tokenizer add the image token and you already have it in your text <|image|> remove it from your json data strip 😉

The type of picture must be jpg

Sign up or log in to comment