Discourse models with language models
- Event: Seminar
- Presented by: Jessy Li from the University of Texas at Austin
- Date: 04 June 2025
- Time: 13:15-15:00
- Venue: Gothenburg University, Humanisten and online
- Address: Renströmsgatan 6, 412 55 Göteborg
- Room: J411
- Zoom link: https://gu-se.zoom.us/j/69780476534?pwd=Q9Uw2lu0zda8MsXkL08eGrqU64DMpp.1
Abstract
How are sentences in a document connected, and why do they make the document feel “coherent”? Computational models of discourse aim to solve this myth by recovering the structural organization of texts, through which writers convey intent and meaning. In the first part of this talk, I will discuss our efforts on modeling human curiosity through question generation, and understanding its connection with discourse representations based on the linguistic theory of Questions Under Discussion. We show that LLMs, with design and training, resurface curiosity-driven questions and ground their elicitation and answers in text. Next, I will demonstrate how such generative discourse models can be used to measure discourse similarities in LLM-generated texts, as well as to derive explainable measures of information salience in LLMs using summarization as a behavioral probe.