pszemraj's picture
update ckpt with 6ish epochs of training with 1024 TOKENS as max output
9996867
raw
history blame
14 Bytes
global_step164