Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets

Don Tuggener1, Margot Mieskes2, Jan Deriu1, Mark Cieliebak1
1Zurich University of Applied Sciences, 2University of Applied Sciences, Darmstadt


Abstract

Dialogue summarization is a long-standing task in the field of NLP, and several data sets with dialogues and associated human-written summaries of different styles exist. However, it is unclear for which type of dialogue which type of summary is most appropriate. For this reason, we apply a linguistic model of dialogue types to derive matching summary items and NLP tasks. This allows us to map existing dialogue summarization data sets into this model and identify gaps and potential directions for future work. As part of this process, we also provide an extensive overview of existing dialogue summarization data sets.