Resources for ICML 2024 paper "Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts"
TrustSafeAI
community
AI & ML interests
Research Demos and Tools for Trustworthy and Safe AI Development and Deployment
spaces
8
Running
👀
LLM Physical Safety
LLM benchmark for Physical Safety
Running
⚡
NeuralFuse
Protect Model from Suffering Low-voltage-induced Bit Errors
Running
⚡
Attention Tracker Prompt Injection Detector
Attention Tracker: Prompt Injection Detector
Running
3
🦀
NCTV: Neural Clamping Toolkit and Visualization
Model-agnostic Toolkit for Neural Network Calibration
Running
🧠
GREAT Score
Running
7
🛡️
GradientCuff-Jailbreak-Defense
Demonstration of Gradient Cuff: A Jailbreak Defense