File size: 1,074 Bytes
751be9e 774a2c8 7b402da 824e6bf 774a2c8 ab93ca6 d012232 ac545ce ab93ca6 ac545ce |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
# dialogue-bart-large-chinese
This is a seq2seq model fine-tuned on several Chinese dialogue datasets, from bart-large-chinese.
# Datasets
We utilize 4 Chinese dialogue datasets from [LUGE](https://www.luge.ai/#/)
| | | |
| ---- | ---- | ---- |
| | Count | Domain |
| Chinese Persona Chat (CPC) | 23,000 | Open |
| LCCC | 11,987,759 | Open |
| Emotional STC (ESTC) | 899,207 | Open |
| KdConv | 3,000 | Movie, Music, Travel |
| | | |
# Example
```python
#!/usr/bin/env python3
from transformers import BertTokenizer, BartForConditionalGeneration
tokenizer = BertTokenizer.from_pretrained("HIT-TMG/dialogue-bart-large-chinese")
model = BartForConditionalGeneration.from_pretrained("HIT-TMG/dialogue-bart-large-chinese")
``` |