anthracite-org
/

magnum-v4-22b

@@ -55,7 +55,7 @@ base_model: /workspace/models/Mistral-Small-Instruct-2409
 model_type: AutoModelForCausalLM
 tokenizer_type: AutoTokenizer
-hub_model_id: anthracite-core/magnum-v4-22b-r4
 hub_strategy: "all_checkpoints"
 push_dataset_to_hub:
 hf_use_auth_token: true
@@ -73,17 +73,17 @@ load_in_4bit: false
 strict: false
 datasets:
-  - path: anthracite-core/c2_logs_32k_mistral-v3_v1.2_no_system
     type: custommistralv2v3
-  - path: anthracite-core/kalo-opus-instruct-22k-no-refusal-no-system
     type: custommistralv2v3
-  - path: anthracite-core/kalo-opus-instruct-3k-filtered-no-system
     type: custommistralv2v3
   - path: anthracite-org/nopm_claude_writing_fixed
     type: custommistralv2v3
-  - path: anthracite-core/kalo_opus_misc_240827_no_system
     type: custommistralv2v3
-  - path: anthracite-core/kalo_misc_part2_no_system
     type: custommistralv2v3
 #chat_template: mistral_v2v3
 shuffle_merged_datasets: true
@@ -151,12 +151,12 @@ We'd like to thank Recursal / Featherless for sponsoring the compute for this tr
 We would also like to thank all members of Anthracite who made this finetune possible.
 ## Datasets
-- [anthracite-core/c2_logs_32k_mistral-v3_v1.2_no_system](https://huggingface.co/datasets/anthracite-core/c2_logs_32k_mistral-v3_v1.2_no_system)
-- [anthracite-core/kalo-opus-instruct-22k-no-refusal-no-system](https://huggingface.co/datasets/anthracite-core/kalo-opus-instruct-22k-no-refusal-no-system)
-- [anthracite-core/kalo-opus-instruct-3k-filtered-no-system](https://huggingface.co/datasets/anthracite-core/kalo-opus-instruct-3k-filtered-no-system)
 - [anthracite-org/nopm_claude_writing_fixed](https://huggingface.co/datasets/anthracite-org/nopm_claude_writing_fixed)
-- [anthracite-core/kalo_opus_misc_240827_no_system](https://huggingface.co/datasets/anthracite-core/kalo_opus_misc_240827_no_system)
-- [anthracite-core/kalo_misc_part2_no_system](https://huggingface.co/datasets/anthracite-core/kalo_misc_part2_no_system)
 ## Training
 The training was done for 2 epochs. We used  8x[H100s](https://www.nvidia.com/en-us/data-center/h100/) GPUs graciously provided by [Recursal AI](https://recursal.ai/) / [Featherless AI](https://featherless.ai/) for the full-parameter fine-tuning of the model.

 model_type: AutoModelForCausalLM
 tokenizer_type: AutoTokenizer
+hub_model_id: anthracite-org/magnum-v4-22b-r4
 hub_strategy: "all_checkpoints"
 push_dataset_to_hub:
 hf_use_auth_token: true
 strict: false
 datasets:
+  - path: anthracite-org/c2_logs_32k_mistral-v3_v1.2_no_system
     type: custommistralv2v3
+  - path: anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system
     type: custommistralv2v3
+  - path: anthracite-org/kalo-opus-instruct-3k-filtered-no-system
     type: custommistralv2v3
   - path: anthracite-org/nopm_claude_writing_fixed
     type: custommistralv2v3
+  - path: anthracite-org/kalo_opus_misc_240827_no_system
     type: custommistralv2v3
+  - path: anthracite-org/kalo_misc_part2_no_system
     type: custommistralv2v3
 #chat_template: mistral_v2v3
 shuffle_merged_datasets: true
 We would also like to thank all members of Anthracite who made this finetune possible.
 ## Datasets
+- [anthracite-org/c2_logs_32k_mistral-v3_v1.2_no_system](https://huggingface.co/datasets/anthracite-org/c2_logs_32k_mistral-v3_v1.2_no_system)
+- [anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system](https://huggingface.co/datasets/anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system)
+- [anthracite-org/kalo-opus-instruct-3k-filtered-no-system](https://huggingface.co/datasets/anthracite-org/kalo-opus-instruct-3k-filtered-no-system)
 - [anthracite-org/nopm_claude_writing_fixed](https://huggingface.co/datasets/anthracite-org/nopm_claude_writing_fixed)
+- [anthracite-org/kalo_opus_misc_240827_no_system](https://huggingface.co/datasets/anthracite-org/kalo_opus_misc_240827_no_system)
+- [anthracite-org/kalo_misc_part2_no_system](https://huggingface.co/datasets/anthracite-org/kalo_misc_part2_no_system)
 ## Training
 The training was done for 2 epochs. We used  8x[H100s](https://www.nvidia.com/en-us/data-center/h100/) GPUs graciously provided by [Recursal AI](https://recursal.ai/) / [Featherless AI](https://featherless.ai/) for the full-parameter fine-tuning of the model.