ibivibiv commited on
Commit
593993d
1 Parent(s): 4d723db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -12
README.md CHANGED
@@ -1,12 +1,17 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
 
 
 
4
  ---
5
 
6
- # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
9
 
 
10
 
11
 
12
  ## Model Details
@@ -18,12 +23,12 @@ tags: []
18
  This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
 
20
  - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
 
28
  ### Model Sources [optional]
29
 
@@ -196,6 +201,4 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
196
 
197
  ## Model Card Contact
198
 
199
- [More Information Needed]
200
-
201
-
 
1
  ---
2
  library_name: transformers
3
+ tags:
4
+ - moe
5
+ license: apache-2.0
6
+ language:
7
+ - en
8
  ---
9
 
10
+ # Model Card for Hydra MOE 120B
11
 
12
+ This is a 4x34b MOE model. I have been expirementing with various gate settings and models. This is yet another run with a different set of models on the same gate definitions.
13
 
14
+ I am using mixtral tooling.
15
 
16
 
17
  ## Model Details
 
23
  This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
24
 
25
  - **Developed by:** [More Information Needed]
26
+ - **Funded by [optional]:** [ibivibiv](https://huggingface.co/ibivibiv) sadly alone and out of my pocket for now :D lol
27
+ - **Shared by [optional]:** [ibivibiv](https://huggingface.co/ibivibiv)
28
+ - **Model type:** MOE of 4 35B models (mistral/yi/llamma)
29
+ - **Language(s) (NLP):** English
30
+ - **License:** Apache 2.0
31
+ - **MOE'd from models :** jondurbin/nontoxic-bagel-34b-v0.2, 01-ai/Yi-34B, bhenrym14/platypus-yi-34b, abacusai/MM-OV-bagel-DPO-34b-c1000-250
32
 
33
  ### Model Sources [optional]
34
 
 
201
 
202
  ## Model Card Contact
203
 
204
+ [More Information Needed]