Assessing Large Language Models on climate information

We present a comprehensive evaluation framework, grounded in science communication principles, to analyze responses of Large Language Models to climate change topics.

Abstract

Understanding how climate change affects us and learning about available solutions are key steps toward empowering individuals and communities to mitigate and adapt to it. As Large Language Models (LLMs) rise in popularity, it is necessary to assess their capability in this domain. In this study, we present a comprehensive evaluation framework, grounded in science communication principles, to analyze LLM responses to climate change topics. Our framework emphasizes both the presentational and epistemological adequacy of answers, offering a fine-grained analysis of LLM generations. Spanning 8 dimensions, our framework discerns up to 30 distinct issues in model outputs. The task is a real-world example of a growing number of challenging problems where AI can complement and lift human performance. We introduce a novel and practical protocol for scalable oversight that uses AI Assistance and relies on raters with relevant educational backgrounds. We evaluate several recent LLMs and conduct a comprehensive analysis of the results, shedding light on both the potential and the limitations of LLMs in the realm of climate communication.

Figure 2: Results for all presentational and epistemological dimensions.

Figure 2: Results for all presentational and epistemological dimensions.

___________________________________________________________

Please cite as: Bulian, J., Schäfer, M. S., Amini, A., Lam, H., Ciaramita, M., Gaiarin, B., Huebscher, M. C., Buck, C., Mede, N. G., Leippold, M., & Strauss, N. (2023). Assessing Large Language Models on climate information. Preprint. https://doi.org/10.48550/arXiv.2310.02932

Link to preprint: https://arxiv.org/abs/2310.02932

Summary on social media: Thread on X

Niels G. Mede
Niels G. Mede
Science Communication Researcher

I am a Senior Research and Teaching Associate at the Department of Communication and Media Research (IKMZ) of the University of Zurich, where I also completed a PhD in communication studies. My work focuses on science communication, public opinion, populism, digital media, climate change communication, and survey methodology. In 2022 and 2023, I was a visiting researcher at the Department of Life Sciences Communication of the University of Wisconsin—Madison and the Oxford Internet Institute. In June 2024, I will join the Digital Media Research Centre of the Queensland University of Technology as a visiting scholar.