2018

Training Millions of Personalized Dialogue Agents

Antoine Bordes

citations

Cite Score

16

AI summary

This paper introduces a large-scale persona-based dialogue dataset, built from REDDIT conversations with over 5 million personas and 700 million dialogues, and shows that training with personas improves end-to-end systems, achieving state-of-the-art results on the PERSONA-CHAT dataset via transfer learning.

Main Contributions

  • Introduces a large-scale persona-based dialogue dataset with 5 million personas and 700 million dialogues.
  • Demonstrates that training with personas improves the performance of end-to-end dialogue systems.
  • Achieves state-of-the-art results on the PERSONA-CHAT dataset through transfer learning from the new dataset.
  • Shows that pre-training on the new dataset leads to considerable improvement in performance.
  • Demonstrates the effectiveness of aligning answers with both the persona of the author and the context.

Abstract

Current dialogue systems are not very engaging for users, especially when trained end-to-end without relying on proactive reengaging scripted strategies. Zhang et al. (2018) showed that the engagement level of end-to-end dialogue models increases when conditioning them on text personas providing some personalized back-story to the model. However, the dataset used in (Zhang et al., 2018) is synthetic and of limited size as it contains around 1k different personas. In this paper we introduce a new dataset providing 5 million personas and 700 million persona-based dialogues. Our experiments show that, at this scale, training using personas still improves the performance of end-to-end systems. In addition, we show that other tasks benefit from the wide coverage of our dataset by fine-tuning our model on the data from (Zhang et al., 2018) and achieving state-of-the-art results.

Citation Graph

Loading graph...

References [13]

Sort:
Filter:

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015

18 papers in library cite

I. V. Serban, A. Sordoni, Yoshua Bengio, Aaron Courville, J. Pineau - 2016

2 papers in library cite

S. Zhang, E. Dinan, J. Urbanek, A. Szlam, Douwe Kiela, Jason Weston - 2018

4 papers in library cite

Jeffrey Li, M. Galley, Chris Brockett, Jianfeng Gao, B. Dolan - 2016

1 paper in library cites

M. Ghazvininejad, Chris Brockett, M. W. Chang, B. Dolan, Jianfeng Gao, W. T. Yih, M. Galley - 2017

3 papers in library cite

J. Dodge, A. Gane, X. Zhang, Antoine Bordes, S. Chopra, A. Miller, A. Szlam, Jason Weston - 2015

4 papers in library cite

T. H. Wen, D. Vandyke, N. Mrksic, M. Gasic, L. M. R. Barahona, P. H. Su, S. Ultes, S. Young - 2016

3 papers in library cite

Antoine Bordes, Y. Lan Boureau, Jason Weston - 2016

2 papers in library cite

T. Young, E. Cambria, I. Chaturvedi, M. Huang, H. Zhou, S. Biswas - 2017

1 paper in library cites

Yining Yang, S. Yuan, D. Cer, S. Y. Kong, Noah Constant, P. Pilar, H. Ge, Y. H. Sung, B. Strope, R. Kurzweil - 2018

1 paper in library cites

Ryan Lowe, I. V. Serban, M. Noseworthy, L. Charlin, J. Pineau - 2016

1 paper in library cites

C. K. Joshi, F. Mi, B. Faltings - 2017

1 paper in library cites

Cited by

1

papers in your library

Cites

7

papers in your library

Read

on November 15, 2025

Your review

Tags

Paper Aliases

No aliases