Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
fnlp
/
moss-rlhf-reward-model-7B-en
like
9
Follow
Fudan NLP
92
Chinese
llm
reward model
moss
rlhf
arxiv:
2307.04964
License:
agpl-3.0
Model card
Files
Files and versions
Community
1
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
The code for training the reward model?
#1 opened about 1 year ago by
AndyWodecki