Adversarial Text Purification: A Large Language Model Approach for Defense
PreviousPrecisely the Point: Adversarial Augmentations for Faithful and Informative Text GenerationNextStudious Bob Fight Back Against Jailbreaking via Prompt Adversarial Tuning
Last updated