Discourse models with language models

Event: Seminar
Presented by: Jessy Li from the University of Texas at Austin
Date: 04 June 2025
Time: 13:15-15:00
Venue: Gothenburg University, Humanisten and online
Address: Renströmsgatan 6, 412 55 Göteborg
Room: J411
Zoom link: https://gu-se.zoom.us/j/69780476534?pwd=Q9Uw2lu0zda8MsXkL08eGrqU64DMpp.1
Slides: Jessy Li 4.6.2025.pdf

Abstract

How are sentences in a document connected, and why do they make the document feel “coherent”? Computational models of discourse aim to solve this myth by recovering the structural organization of texts, through which writers convey intent and meaning. In the first part of this talk, I will discuss our efforts on modeling human curiosity through question generation, and understanding its connection with discourse representations based on the linguistic theory of Questions Under Discussion. We show that LLMs, with design and training, resurface curiosity-driven questions and ground their elicitation and answers in text. Next, I will demonstrate how such generative discourse models can be used to measure discourse similarities in LLM-generated texts, as well as to derive explainable measures of information salience in LLMs using summarization as a behavioral probe.