Papperoni

2018

Training Millions of Personalized Dialogue Agents

Antoine Bordes

citations

Cite Score

AI summary

This paper introduces a large-scale persona-based dialogue dataset, built from REDDIT conversations with over 5 million personas and 700 million dialogues, and shows that training with personas improves end-to-end systems, achieving state-of-the-art results on the PERSONA-CHAT dataset via transfer learning.

Main Contributions

Introduces a large-scale persona-based dialogue dataset with 5 million personas and 700 million dialogues.
Demonstrates that training with personas improves the performance of end-to-end dialogue systems.
Achieves state-of-the-art results on the PERSONA-CHAT dataset through transfer learning from the new dataset.
Shows that pre-training on the new dataset leads to considerable improvement in performance.
Demonstrates the effectiveness of aligning answers with both the persona of the author and the context.

Abstract

Current dialogue systems are not very engaging for users, especially when trained end-to-end without relying on proactive reengaging scripted strategies. Zhang et al. (2018) showed that the engagement level of end-to-end dialogue models increases when conditioning them on text personas providing some personalized back-story to the model. However, the dataset used in (Zhang et al., 2018) is synthetic and of limited size as it contains around 1k different personas. In this paper we introduce a new dataset providing 5 million personas and 700 million persona-based dialogues. Our experiments show that, at this scale, training using personas still improves the performance of end-to-end systems. In addition, we show that other tasks benefit from the wide coverage of our dataset by fine-tuning our model on the data from (Zhang et al., 2018) and achieving state-of-the-art results.

Citation Graph

Loading graph...

References [13]

Sort:

Filter:

[1]Attention Is All You Need

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Google Scholar

I mean... it introduced Transformers!

[2]End-to-End Memory Networks

S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015

18 papers in library cite

Google Scholar

This was so surprising! This is very similar to transformers and RAG. Who knew?!

[3]Building End-to-End Dialogue Systems Using Generative Hierarchical Neural Network Models

I. V. Serban, A. Sordoni, Yoshua Bengio, Aaron Courville, J. Pineau - 2016

2 papers in library cite

Google Scholar

Dialogue systems - something that became very important after that

[4]Personalizing Dialogue Agents: I Have a Dog, Do You Have Pets Too?

S. Zhang, E. Dinan, J. Urbanek, A. Szlam, Douwe Kiela, Jason Weston - 2018

4 papers in library cite

Google Scholar

Maybe the first time I have seen "agents" in a title

[5]A Persona-Based Neural Conversation Model

Jeffrey Li, M. Galley, Chris Brockett, Jianfeng Gao, B. Dolan - 2016

1 paper in library cites

Google Scholar

[6]A Knowledge-Grounded Neural Conversation Model

M. Ghazvininejad, Chris Brockett, M. W. Chang, B. Dolan, Jianfeng Gao, W. T. Yih, M. Galley - 2017

3 papers in library cite

Google Scholar

[7]Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems

J. Dodge, A. Gane, X. Zhang, Antoine Bordes, S. Chopra, A. Miller, A. Szlam, Jason Weston - 2015

4 papers in library cite

Google Scholar

I like these "prerequisite" papers

[8]A Network-Based End-to-End Trainable Task-Oriented Dialogue System

T. H. Wen, D. Vandyke, N. Mrksic, M. Gasic, L. M. R. Barahona, P. H. Su, S. Ultes, S. Young - 2016

3 papers in library cite

Google Scholar

[9]Learning End-to-End Goal-Oriented Dialog

Antoine Bordes, Y. Lan Boureau, Jason Weston - 2016

2 papers in library cite

Google Scholar

[10]Augmenting End-to-End Dialog Systems With Commonsense Knowledge

T. Young, E. Cambria, I. Chaturvedi, M. Huang, H. Zhou, S. Biswas - 2017

1 paper in library cites

Google Scholar

[11]Learning Semantic Textual Similarity From Conversations

Yining Yang, S. Yuan, D. Cer, S. Y. Kong, Noah Constant, P. Pilar, H. Ge, Y. H. Sung, B. Strope, R. Kurzweil - 2018

1 paper in library cites

Google Scholar

[12]On the Evaluation of Dialogue Systems With Next Utterance Classification

Ryan Lowe, I. V. Serban, M. Noseworthy, L. Charlin, J. Pineau - 2016

1 paper in library cites

Google Scholar

[13]Personalization in Goal-Oriented Dialog

C. K. Joshi, F. Mi, B. Faltings - 2017

1 paper in library cites

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on November 15, 2025

I think this is very derivative from other papers (which I haven't read yet). But overall is a good read.