OpenSourceRonin's picture
Upload model Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft
1ac1173 verified
raw
history blame
171 Bytes
{
"attn_implementation": "flash_attention_2",
"bos_token_id": 128000,
"eos_token_id": [
128001,
128008,
128009
],
"transformers_version": "4.45.2"
}