Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation



PreviousOnthe Robustness of Large Multimodal Models Against Image Adversarial AttacksNextSafety Fine-Tuning at (Almost) No Cost: ABaseline for Vision Large Language Models
Last updated