Text Generation
Transformers
PyTorch
TensorBoard
Safetensors
bloom
Eval Results
text-generation-inference
Inference Endpoints
Younes Belkada commited on
Commit
9dadebd
1 Parent(s): 52a7843

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -139,6 +139,7 @@ Please see [the BLOOM training README](https://github.com/bigscience-workshop/bi
139
  * Sequence length of 2048 tokens used (see [BLOOM tokenizer](https://huggingface.co/bigscience/tokenizer), [tokenizer description](#tokenization))
140
 
141
  **Objective Function:** Cross Entropy with mean reduction (see [API documentation](https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html#torch.nn.CrossEntropyLoss)).
 
142
 
143
  ### Compute infrastructure
144
  Jean Zay Public Supercomputer, provided by the French government (see [announcement](https://www.enseignementsup-recherche.gouv.fr/fr/signature-du-marche-d-acquisition-de-l-un-des-supercalculateurs-les-plus-puissants-d-europe-46733)).
@@ -371,6 +372,11 @@ Intentionally using the model for harm, violating [human rights](#human-rights),
371
 
372
  - Generating content without attribution to the model, as specified in the [RAIL License, Use Restrictions](https://huggingface.co/spaces/bigscience/license)
373
 
 
 
 
 
 
374
  ## Intended Users
375
 
376
  ### Direct Users
@@ -407,6 +413,7 @@ Intentionally using the model for harm, violating [human rights](#human-rights),
407
 
408
  ---
409
 
 
410
  # Risks and Limitations
411
  *This section identifies foreseeable harms and misunderstandings.*
412
 
 
139
  * Sequence length of 2048 tokens used (see [BLOOM tokenizer](https://huggingface.co/bigscience/tokenizer), [tokenizer description](#tokenization))
140
 
141
  **Objective Function:** Cross Entropy with mean reduction (see [API documentation](https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html#torch.nn.CrossEntropyLoss)).
142
+
143
 
144
  ### Compute infrastructure
145
  Jean Zay Public Supercomputer, provided by the French government (see [announcement](https://www.enseignementsup-recherche.gouv.fr/fr/signature-du-marche-d-acquisition-de-l-un-des-supercalculateurs-les-plus-puissants-d-europe-46733)).
 
372
 
373
  - Generating content without attribution to the model, as specified in the [RAIL License, Use Restrictions](https://huggingface.co/spaces/bigscience/license)
374
 
375
+ ## Intermediate checkpoints
376
+
377
+ For academic (or any) usage, we published the intermediate checkpoints, corresponding to the model state at each 5000 steps. Please follow [this link](https://huggingface.co/bigscience/bloom-176-intermediate) to get these checkpoints.
378
+
379
+
380
  ## Intended Users
381
 
382
  ### Direct Users
 
413
 
414
  ---
415
 
416
+
417
  # Risks and Limitations
418
  *This section identifies foreseeable harms and misunderstandings.*
419