noneUsername's picture
Create README.md
7a34e9f verified
|
raw
history blame
1.04 kB
metadata
base_model:
  - anthracite-org/magnum-v4-22b

vllm (pretrained=/root/autodl-tmp/magnum-v4-22b-W8A8,add_bos_token=true,tensor_parallel_size=2,max_model_len=2048,dtype=bfloat16), gen_kwargs: (None), limit: 250.0, num_fewshot: 5, batch_size: auto

Tasks Version Filter n-shot Metric Value Stderr
gsm8k 3 flexible-extract 5 exact_match 0.844 ± 0.023
strict-match 5 exact_match 0.808 ± 0.025

vllm (pretrained=/root/autodl-tmp/magnum-v4-22b-W8A8,add_bos_token=true,tensor_parallel_size=2,max_model_len=2048,dtype=float16), gen_kwargs: (None), limit: 250.0, num_fewshot: 5, batch_size: auto

Tasks Version Filter n-shot Metric Value Stderr
gsm8k 3 flexible-extract 5 exact_match 0.840 ± 0.0232
strict-match 5 exact_match 0.804 ± 0.0252