Datorien Anderson's picture

8 11

Datorien Anderson

niltheory

·

https://ko-fi.com/occybyte

AI & ML interests

Deep Reinforcement Learning, Natural Language Processing

Organizations

None yet

niltheory's activity

upvoted 2 papers 2 months ago

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Paper • 2402.04249 • Published Feb 6 • 3

Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique

Paper • 2408.10701 • Published Aug 20 • 10

upvoted 2 articles 2 months ago

Article

Introducing the Red-Teaming Resistance Leaderboard

Feb 23

• 12

Article

Red-Teaming Large Language Models

Feb 24, 2023

• 14

upvoted an article 4 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 214

upvoted 3 papers 11 months ago

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 28

LLM360: Towards Fully Transparent Open-Source LLMs

Paper • 2312.06550 • Published Dec 11, 2023 • 56

Controllable Human-Object Interaction Synthesis

Paper • 2312.03913 • Published Dec 6, 2023 • 22