Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge

Last updated