UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
PreviousS-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large LanguageNextJailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against
Last updated
