Qwen
/

Qwen2.5-72B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jklj077 commited on Sep 17

Commit

b1faa85

•

1 Parent(s): 64fc727

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we rele
 **This repo contains the instruction-tuned 72B Qwen2.5 model**, which has the following features:
 - Type: Causal Language Models
-- Training Stage: Pretraining, SFT, DRPO
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
 - Number of Parameters: 72.7B
 - Number of Paramaters (Non-Embedding): 70.0B

 **This repo contains the instruction-tuned 72B Qwen2.5 model**, which has the following features:
 - Type: Causal Language Models
+- Training Stage: Pretraining & Post-training
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
 - Number of Parameters: 72.7B
 - Number of Paramaters (Non-Embedding): 70.0B