Studious Bob Fight Back Against Jailbreaking via Prompt Adversarial Tuning
PreviousAdversarial Text Purification: A Large Language Model Approach for DefenseNextRobust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
Last updated

