2021
Efficient Large Scale Language Modeling With Mixtures of Experts
M. Artetxe, S. Bhosale, N. Goyal, T. Mihaylov, M. Ott, Sam Shleifer, X. V. Lin, J. Du, S. Iyer, R. Pasunuru, G. Anantharaman, Xiang Lisa Li, S. Chen, H. Akin, M. Baines, L. Martin, Xinyu Zhou, P. S. Koura, B. O'horo, J. Wang, Luke Zettlemoyer, M. Diab, Z. Kozareva, Veselin Stoyanov
Citation Graph
References [0]