2019

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Ben Hoover, Hendrik Strobelt, Sebastian Gehrmann

citations

Cite Score

11

AI summary

This paper introduces EXBERT, an interactive visual analysis tool for Transformer models like BERT, that explores learned representations and attention mechanisms by matching user-specified inputs to similar contexts in a large annotated dataset, providing intuitive explanations of attention-head functions.

Main Contributions

  • Introduces EXBERT, an interactive visualization tool for exploring learned representations in Transformer models.
  • Combines static analysis with a dynamic and intuitive view of attention and internal representations.
  • Uses a nearest neighbor search of embeddings on an annotated corpus to provide insights.
  • Demonstrates applicability of EXBERT to the BERT model using the Wizard of Oz corpus.
  • Helps understand what information BERT encodes and how it uses attention by probing linguistic features and positional information.

Abstract

Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Since these models are typically guided by a sequence of learned self attention mechanisms and may comprise undesired inductive biases, it is paramount to be able to explore what the attention has learned. While static analyses of these models lead to targeted insights, interactive tools are more dynamic and can help humans better gain an intuition for the model-internal reasoning process. We present EXBERT, an interactive tool named after the popular BERT language model, that provides insights into the meaning of the contextual representations by matching a human-specified input to similar contexts in a large annotated dataset. By aggregating the annotations of the matching similar contexts, EXBERT helps intuitively explain what each attention-head has learned.

Citation Graph

Loading graph...

References [23]

Sort:
Filter:

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018

39 papers in library cite

Yibo Liu, M. Ott, N. Goyal, J. Du, M. Joshi, Deli Chen, Omer Levy, Martha Lewis, Luke Zettlemoyer, Veselin Stoyanov - 2019

17 papers in library cite

Alec Radford, Jeffrey Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever - 2019

27 papers in library cite

Thomas Wolf - 2019

6 papers in library cite

R. Sennrich, B. Haddow, Alexandra Birch - 2016

22 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Thomas Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, Sam Shleifer, P. V. Platen, C. Ma, Yacine Jernite, J. Plu, Chenfeng Xu, T. L. Scao, S. Gugger, M. Drame, Q. Lhoest, Alexander M. Rush - 2019

7 papers in library cite

A. Wang, Y. Pruksachatkun, Nikita Nangia, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2019

15 papers in library cite

J. Johnson, M. Douze, Hervé Jégou - 2017

4 papers in library cite

Nitish Shirish Keskar, B. Mccann, L. R. Varshney, Caiming Xiong, Richard Socher - 2019

4 papers in library cite

I. Tenney, P. Xia, Berlin Chen, A. Wang, A. Poliak, R. T. Mccoy, N. Kim, B. V. Durme, S. Bowman, Dipanjan Das, Ellie Pavlick - 2019

4 papers in library cite

A. Raganato, J. Tiedemann - 2018

2 papers in library cite

E. Voita, D. Talbot, F. Moiseev, R. Sennrich, T. Ivan - 2019

2 papers in library cite

I. Tenney, Dipanjan Das, Ellie Pavlick - 2019

2 papers in library cite

Hendrik Strobelt, Sebastian Gehrmann, H. Pfister, Alexander M. Rush - 2017

2 papers in library cite

J. Vig - 2019

1 paper in library cites

J. Vig, Yonatan Belinkov - 2019

1 paper in library cites

Shantanu Jain, B. C. Wallace - 2019

1 paper in library cites

S. Wiegreffe, Y. Pinter - 2019

1 paper in library cites

G. Brunner, Yibo Liu, D. Pascual, O. Richter, R. Wattenhofer - 2019

1 paper in library cites

K. Clark, U. Khandelwal, Omer Levy, Christopher D. Manning - 2019

1 paper in library cites

Hendrik Strobelt, Sebastian Gehrmann, M. Behrisch, A. Perer, H. Pfister, Alexander M. Rush - 2018

1 paper in library cites

Cited by

1

papers in your library

Cites

22

papers in your library

Read

on January 11, 2026

Your review

Tags

Paper Aliases

No aliases