2019
Cite Score
11
AI summary
This paper introduces EXBERT, an interactive visual analysis tool for Transformer models like BERT, that explores learned representations and attention mechanisms by matching user-specified inputs to similar contexts in a large annotated dataset, providing intuitive explanations of attention-head functions.
Main Contributions
Abstract
Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Since these models are typically guided by a sequence of learned self attention mechanisms and may comprise undesired inductive biases, it is paramount to be able to explore what the attention has learned. While static analyses of these models lead to targeted insights, interactive tools are more dynamic and can help humans better gain an intuition for the model-internal reasoning process. We present EXBERT, an interactive tool named after the popular BERT language model, that provides insights into the meaning of the contextual representations by matching a human-specified input to similar contexts in a large annotated dataset. By aggregating the annotations of the matching similar contexts, EXBERT helps intuitively explain what each attention-head has learned.
Citation Graph
References [23]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018
39 papers in library cite
Yibo Liu, M. Ott, N. Goyal, J. Du, M. Joshi, Deli Chen, Omer Levy, Martha Lewis, Luke Zettlemoyer, Veselin Stoyanov - 2019
17 papers in library cite
Alec Radford, Jeffrey Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever - 2019
27 papers in library cite
Thomas Wolf - 2019
6 papers in library cite
R. Sennrich, B. Haddow, Alexandra Birch - 2016
22 papers in library cite
A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018
26 papers in library cite
Thomas Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, Sam Shleifer, P. V. Platen, C. Ma, Yacine Jernite, J. Plu, Chenfeng Xu, T. L. Scao, S. Gugger, M. Drame, Q. Lhoest, Alexander M. Rush - 2019
7 papers in library cite
A. Wang, Y. Pruksachatkun, Nikita Nangia, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2019
15 papers in library cite
J. Johnson, M. Douze, Hervé Jégou - 2017
4 papers in library cite
Nitish Shirish Keskar, B. Mccann, L. R. Varshney, Caiming Xiong, Richard Socher - 2019
4 papers in library cite
I. Tenney, P. Xia, Berlin Chen, A. Wang, A. Poliak, R. T. Mccoy, N. Kim, B. V. Durme, S. Bowman, Dipanjan Das, Ellie Pavlick - 2019
4 papers in library cite
A. Raganato, J. Tiedemann - 2018
2 papers in library cite
E. Voita, D. Talbot, F. Moiseev, R. Sennrich, T. Ivan - 2019
2 papers in library cite
I. Tenney, Dipanjan Das, Ellie Pavlick - 2019
2 papers in library cite
Hendrik Strobelt, Sebastian Gehrmann, H. Pfister, Alexander M. Rush - 2017
2 papers in library cite
J. Vig - 2019
1 paper in library cites
J. Vig, Yonatan Belinkov - 2019
1 paper in library cites
Shantanu Jain, B. C. Wallace - 2019
1 paper in library cites
S. Wiegreffe, Y. Pinter - 2019
1 paper in library cites
G. Brunner, Yibo Liu, D. Pascual, O. Richter, R. Wattenhofer - 2019
1 paper in library cites
K. Clark, U. Khandelwal, Omer Levy, Christopher D. Manning - 2019
1 paper in library cites
Hendrik Strobelt, Sebastian Gehrmann, M. Behrisch, A. Perer, H. Pfister, Alexander M. Rush - 2018
1 paper in library cites
Cited by
1
papers in your library
Cites
22
papers in your library
Read
on January 11, 2026
Your review
Tags
Paper Aliases
No aliases