2021

Efficient Large-Scale Language Model Training on GPU clusters Using Megatron-Lm

D. Narayanan, M. Shoeybi, J. Casper, P. Legresley, M. Patwary, V. Korthikanti, D. Vainbrand, P. Kashinkunti, J. Bernauer, Bryan Catanzaro, A. Phanishayee, Matei Zaharia

citations

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

1

papers in your library

Cites

0

papers in your library

Notes

Tags

Paper Aliases

No aliases