OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
PreviousHALLUSIONBENCH: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual IllusiNextToViLaG: Your Visual-Language Generative Model is Also An Evildoer
Last updated

