Addressing Large Language Models that Lie: Case Studies in Summarization | UCSB Center for Responsible Machine Learning

Kathleen R. McKeown

Columbia University

Friday, April 21, 2023 1:00 PM

Location

HENLEY HALL 1010

Event Registration

LECTURE: 1:00 PM, RECEPTION: 2:30 PM

Abstract: Text summarization is a sub-field within natural language processing that aims to automatically generate short, paragraph length summaries given an input document. Much of the work has been done on news, but there have also been efforts on many other genres, including journal articles, medical documents, email, dialog, legal documents and even creative texts such as novel chapters. The advent of large language models promises a new level of performance in summarization, enabling the generation of summaries that are far more fluent, coherent and relevant than was previously possible. However, they also introduce a major new problem: they wholly hallucinate facts out of thin air. They may incorrectly intermingle facts from the input, they may introduce facts that were not mentioned at all, and worse yet, they may even make up things that are not true in the real world. In this talk, I will discuss our work in characterizing the kinds of errors that can occur and methods that we have developed to help mitigate hallucination in language modeling approaches to text summarization for a variety of genres.

Bio: Kathleen R. McKeown is the Henry and Gertrude Rothschild Professor of Computer Science at Columbia University and the Founding Director of the Data Science Institute, serving as Director from 2012 to 2017. She is also an Amazon Scholar. In earlier years, she served as Department Chair (1998-2003) and as Vice Dean for Research for the School of Engineering and Applied Science (2010-2012). A leading scholar and researcher in the field of natural language processing, McKeown focuses her research on the use of data for societal problems; her interests include text summarization, question answering, natural language generation, social media analysis and multilingual applications. She has received numerous honors and awards, including 2023 IEEE Innovation in Societal Infrastructure Award, American Philosophical Society Elected member, American Academy of Arts and Science elected member, American Association of Artificial Intelligence Fellow, a Founding Fellow of the Association for Computational Linguistics and an Association for Computing Machinery Fellow. Early on she received the National Science Foundation Presidential Young Investigator Award, and a National Science Foundation Faculty Award for Women. In 2010, she won both the Columbia Great Teacher Award—an honor bestowed by the students—and the Anita Borg Woman of Vision Award for Innovation.

This event is cosponsored by the Center for Responsible Machine Learning, the Mellichamp Initiative in Mind and Machine Intelligence, and the Center for Information Technology and Society.