大模型安全笔记
Search...
Ctrl
K
LLM-Defense
ROSE: Robust Selective Fine-tuning for Pre-trained Language Models
Previous
THE POISON OF ALIGNMENT
Next
GAINING WISDOM FROM SETBACKS : ALIGNING LARGE LANGUAGE MODELS VIA MISTAKE ANALYSIS
Last updated
1 year ago