Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
38
46
Quentin Gallouédec
qgallouedec
Follow
cadene's profile picture
Akash20000's profile picture
Newseal's profile picture
35 followers
·
29 following
https://gallouedec.com
QGallouedec
qgallouedec
AI & ML interests
None yet
Articles
Preference Optimization for Vision Language Models
Jul 10
•
41
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Apr 22
•
78
Organizations
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
2
Sort: Recently updated
Sleeping
🚀
Autotrain
Sleeping
🏃
Bibtex Cleaner
models
680
Sort: Recently updated
qgallouedec/Qwen2-0.5B-OnlineDPO-AutoRM
Text Generation
•
Updated
12 days ago
•
8
qgallouedec/Qwen2.5-0.5B-Online-DPO-AutoRM
Updated
12 days ago
•
2
qgallouedec/Qwen2.5-0.5B-Online-DPO-PairRM
Updated
12 days ago
qgallouedec/tiny-Idefics2ForConditionalGeneration
Image-Text-to-Text
•
Updated
14 days ago
•
893
qgallouedec/tiny-T5ForConditionalGeneration
Text2Text Generation
•
Updated
15 days ago
•
2.16k
qgallouedec/tiny-Qwen2ForCausalLM
Text Generation
•
Updated
15 days ago
•
35k
qgallouedec/tiny-Phi3ForCausalLM
Text Generation
•
Updated
15 days ago
•
6
qgallouedec/tiny-MistralForCausalLM-0.3
Text Generation
•
Updated
15 days ago
•
27
qgallouedec/tiny-MistralForCausalLM-0.2
Text Generation
•
Updated
15 days ago
•
8
qgallouedec/tiny-MistralForCausalLM-0.1
Text Generation
•
Updated
15 days ago
•
25
Expand 680 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
about 21 hours ago
•
58.1k
•
201
qgallouedec/prm800k
Viewer
•
Updated
Oct 1
•
41.2k
•
41
•
1
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9
•
60.9k
•
31
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9
•
16.6k
•
30
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9
•
6.26k
•
33
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9
•
6.26k
•
33
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9
•
179k
•
33
qgallouedec/tldr
Viewer
•
Updated
Sep 9
•
130k
•
35
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5
•
46.2k
•
32
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5
•
46.2k
•
53
Expand 67 datasets