[C7] LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
Eunsu Kim, Juyoung Suk, Seungone Kim, Niklas Muennighoff, Dongkwan Kim, Alice Oh. Findings of the Association for Computational Linguistics: ACL 2025 (ACL-Findings 2025, Long), 2025
One-sentence Summary: We propose a new paradigm for evaluating large language models (LLMs), called LLM-as-an-Interviewer.