CausalLM
/

72B-preview-llamafied-qwen-llamafy

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JosephusCheung commited on Nov 30, 2023

Commit

dbeb915

•

1 Parent(s): ac6ce97

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: gpl-3.0
 ---
 # A Chat Model
 Use the transformers library that does not require remote/external code to load the model, AutoModelForCausalLM and AutoTokenizer (or manually specify LlamaForCausalLM to load LM, GPT2Tokenizer to load Tokenizer), and model quantization should be fully compatible with GGUF (llama.cpp), GPTQ, and AWQ.
 *Do not use wikitext for recalibration.*

 ---
 # A Chat Model
+It should take an hour or so to be uploaded, and I am working on a gguf version.
 Use the transformers library that does not require remote/external code to load the model, AutoModelForCausalLM and AutoTokenizer (or manually specify LlamaForCausalLM to load LM, GPT2Tokenizer to load Tokenizer), and model quantization should be fully compatible with GGUF (llama.cpp), GPTQ, and AWQ.
 *Do not use wikitext for recalibration.*