2016

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima

Nitish Shirish Keskar, D. Mudigere, J. Nocedal, M. Smelyanskiy, P. T. P. Tang

citations

Cite Score

70

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

4

papers in your library

Cites

0

papers in your library

Notes

Tags

Paper Aliases

No aliases