pszemraj's picture
update ckpt with 6ish epochs of training with 1024 TOKENS as max output
9996867
raw
history blame
13 Bytes
checkpoint-*/