Update README.md
Browse files
README.md
CHANGED
@@ -24,7 +24,7 @@ Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we rele
|
|
24 |
|
25 |
**This repo contains the instruction-tuned 72B Qwen2.5 model**, which has the following features:
|
26 |
- Type: Causal Language Models
|
27 |
-
- Training Stage: Pretraining
|
28 |
- Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
|
29 |
- Number of Parameters: 72.7B
|
30 |
- Number of Paramaters (Non-Embedding): 70.0B
|
|
|
24 |
|
25 |
**This repo contains the instruction-tuned 72B Qwen2.5 model**, which has the following features:
|
26 |
- Type: Causal Language Models
|
27 |
+
- Training Stage: Pretraining & Post-training
|
28 |
- Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
|
29 |
- Number of Parameters: 72.7B
|
30 |
- Number of Paramaters (Non-Embedding): 70.0B
|