Preliminary Program

Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets

Don Tuggener¹, Margot Mieskes², Jan Deriu¹, Mark Cieliebak¹
¹Zurich University of Applied Sciences, ²University of Applied Sciences, Darmstadt

Abstract

Dialogue summarization is a long-standing task in the field of NLP, and several data sets with dialogues and associated human-written summaries of different styles exist. However, it is unclear for which type of dialogue which type of summary is most appropriate. For this reason, we apply a linguistic model of dialogue types to derive matching summary items and NLP tasks. This allows us to map existing dialogue summarization data sets into this model and identify gaps and potential directions for future work. As part of this process, we also provide an extensive overview of existing dialogue summarization data sets.