2021
Efficient Large-Scale Language Model Training on GPU clusters Using Megatron-Lm
D. Narayanan, M. Shoeybi, J. Casper, P. Legresley, M. Patwary, V. Korthikanti, D. Vainbrand, P. Kashinkunti, J. Bernauer, Bryan Catanzaro, A. Phanishayee, Matei Zaharia
Citation Graph
References [0]
No references match the current filters.
Cited by
1
papers in your library
Cites
0
Add to reading list
Notes
Tags
Paper Aliases
No aliases