Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
chatglm-6b
like
2.83k
Follow
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
1,210
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
Inference Endpoints
arxiv:
2103.10360
arxiv:
2210.02414
arxiv:
2406.12793
Model card
Files
Files and versions
Community
109
Train
Deploy
Use this model
main
chatglm-6b
/
modeling_chatglm.py
Commit History
Fix position ids in 1d position encoding
a8ede82
zxdu20
commited on
Apr 27, 2023
Fix input embeds
35ca523
zxdu20
commited on
Apr 18, 2023
Change mask positions to batch
4de8efe
zxdu20
commited on
Apr 14, 2023
Add empty_init option
eb55ff0
zxdu20
commited on
Apr 13, 2023
Fix attention score on mps
cde457b
zxdu20
commited on
Apr 9, 2023
Fix LogitsProcessor using slim checkpoint (
#29
)
61eee50
zxdu20
bcol
commited on
Apr 8, 2023
Use gmask in first place
9324de7
zxdu20
commited on
Apr 6, 2023
Update code for slim
63ce1ba
zxdu20
commited on
Apr 6, 2023
fix typo in use_gmask (
#21
)
551a50e
zxdu20
fzhang
commited on
Apr 5, 2023
Fix position ids expand
f82b180
zxdu20
commited on
Apr 3, 2023
Fix generate
fb23542
zxdu20
commited on
Apr 2, 2023
Fix attention mask for prefix prompt
08bc851
zxdu20
commited on
Apr 1, 2023
No padding for chat function
4b7ffbf
zxdu20
commited on
Apr 1, 2023
Implement batch generation
cc96a22
zxdu20
commited on
Apr 1, 2023
Fix position id for training
11c270c
zxdu20
commited on
Mar 31, 2023
fix GLM6BBlock name typo (
#20
)
9c7416d
zxdu20
richbrain
commited on
Mar 31, 2023
Add support for loading quantized model
2e1be30
zxdu20
commited on
Mar 31, 2023
Use dynamic dtype for prompts
c949d03
zxdu20
commited on
Mar 30, 2023
Fix backward for quantization
0cfae21
zxdu20
commited on
Mar 30, 2023
Implement gradient checkpointing
aea6cef
zxdu20
commited on
Mar 30, 2023
Fix bugs
0564795
zxdu20
commited on
Mar 30, 2023
Add pad_token_id in config.json
2200e2b
zxdu20
commited on
Mar 29, 2023
Set ignore_index for CrossEntropyLoss
5c64357
zxdu20
commited on
Mar 29, 2023
Support batch training
8127ab6
zxdu20
commited on
Mar 29, 2023
Merge branch 'main' into dev_pt
fbda120
zxdu20
commited on
Mar 29, 2023
Add p-tuning v2
812f43f
zxdu20
commited on
Mar 29, 2023
Fix context length in get_position_ids
096f3de
zxdu20
commited on
Mar 28, 2023
Close CPU fusion on Mac
4a9b711
zxdu20
commited on
Mar 23, 2023
Fix Chinese punctuation
d2bbc82
zxdu20
commited on
Mar 22, 2023
Remove hardcode bos_token_id
2460dc2
zxdu20
commited on
Mar 19, 2023
Add support for streaming output
42095d4
zxdu20
commited on
Mar 19, 2023
Fix overflow in FP16
220f772
zxdu20
commited on
Mar 16, 2023
Set is_parallelizable to False
f9f74fd
zxdu20
commited on
Mar 15, 2023
Add logit processor for NaN or Inf scores
c3dece3
zxdu20
commited on
Mar 15, 2023
Fix default history argument
9d1509a
zxdu20
commited on
Mar 14, 2023
Add support for float32
d4832e8
zxdu20
commited on
Mar 14, 2023
Fix past_key_values
cd8041e
zxdu20
commited on
Mar 13, 2023
Add chatglm-6b
d11c6aa
Sengxian
commited on
Mar 13, 2023