Automatic Chain of Thought Prompting in Large Language Models Paper • 2210.03493 • Published Oct 7, 2022 • 2
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 48
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 15
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 14
The Power of Scale for Parameter-Efficient Prompt Tuning Paper • 2104.08691 • Published Apr 18, 2021 • 9
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning Paper • 2303.15647 • Published Mar 28, 2023 • 4
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems Paper • 1905.00537 • Published May 2, 2019 • 2
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Paper • 1804.07461 • Published Apr 20, 2018 • 4