Cite Score
16
AI summary
This paper introduces a large-scale persona-based dialogue dataset, built from REDDIT conversations with over 5 million personas and 700 million dialogues, and shows that training with personas improves end-to-end systems, achieving state-of-the-art results on the PERSONA-CHAT dataset via transfer learning.
Main Contributions
Abstract
Current dialogue systems are not very engaging for users, especially when trained end-to-end without relying on proactive reengaging scripted strategies. Zhang et al. (2018) showed that the engagement level of end-to-end dialogue models increases when conditioning them on text personas providing some personalized back-story to the model. However, the dataset used in (Zhang et al., 2018) is synthetic and of limited size as it contains around 1k different personas. In this paper we introduce a new dataset providing 5 million personas and 700 million persona-based dialogues. Our experiments show that, at this scale, training using personas still improves the performance of end-to-end systems. In addition, we show that other tasks benefit from the wide coverage of our dataset by fine-tuning our model on the data from (Zhang et al., 2018) and achieving state-of-the-art results.
Citation Graph
References [13]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015
18 papers in library cite
I. V. Serban, A. Sordoni, Yoshua Bengio, Aaron Courville, J. Pineau - 2016
2 papers in library cite
S. Zhang, E. Dinan, J. Urbanek, A. Szlam, Douwe Kiela, Jason Weston - 2018
4 papers in library cite
Jeffrey Li, M. Galley, Chris Brockett, Jianfeng Gao, B. Dolan - 2016
1 paper in library cites
M. Ghazvininejad, Chris Brockett, M. W. Chang, B. Dolan, Jianfeng Gao, W. T. Yih, M. Galley - 2017
3 papers in library cite
J. Dodge, A. Gane, X. Zhang, Antoine Bordes, S. Chopra, A. Miller, A. Szlam, Jason Weston - 2015
4 papers in library cite
T. H. Wen, D. Vandyke, N. Mrksic, M. Gasic, L. M. R. Barahona, P. H. Su, S. Ultes, S. Young - 2016
3 papers in library cite
Antoine Bordes, Y. Lan Boureau, Jason Weston - 2016
2 papers in library cite
T. Young, E. Cambria, I. Chaturvedi, M. Huang, H. Zhou, S. Biswas - 2017
1 paper in library cites
Yining Yang, S. Yuan, D. Cer, S. Y. Kong, Noah Constant, P. Pilar, H. Ge, Y. H. Sung, B. Strope, R. Kurzweil - 2018
1 paper in library cites
Ryan Lowe, I. V. Serban, M. Noseworthy, L. Charlin, J. Pineau - 2016
1 paper in library cites
C. K. Joshi, F. Mi, B. Faltings - 2017
1 paper in library cites
Cited by
1
papers in your library
Cites
7
papers in your library
Read
on November 15, 2025
Your review
Tags
Paper Aliases
No aliases