New AI tools have the potential to change the way workers perform and learn, but little is known about their impacts on the job. In this paper, we study the staggered introduction of a generative ...
Abstract: This paper presents an automated design method for a Strong-ARM dynamic comparator. The dynamic characteristics of the dynamic comparator are analyzed and fitted into static characteristics, ...
Abstract: Evaluating large language models (LLMs) presents unique challenges. While automatic side-by-side evaluation, also known as LLM-as-a-judge, has become a promising solution, model developers ...