S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models Paper • 2405.14191 • Published May 23, 2024 • 1