Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge
PreviousDefending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-TuningNextPromptFix: Few-shot Backdoor Removal via Adversarial Prompt Tuning
Last updated
