Commonsense UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations Paper • 2311.08469 • Published Nov 14, 2023 • 10
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations Paper • 2311.08469 • Published Nov 14, 2023 • 10
Alignment MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Paper • 2311.07689 • Published Nov 13, 2023 • 7 Trusted Source Alignment in Large Language Models Paper • 2311.06697 • Published Nov 12, 2023 • 10 Unveiling Safety Vulnerabilities of Large Language Models Paper • 2311.04124 • Published Nov 7, 2023 • 6
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Paper • 2311.07689 • Published Nov 13, 2023 • 7
Unveiling Safety Vulnerabilities of Large Language Models Paper • 2311.04124 • Published Nov 7, 2023 • 6