Old Web
English
Sign In
Acemap
>
Paper
>
Training Deep Networks with Stochastic Gradient Normalized by Layerwise Adaptive Second Moments
Training Deep Networks with Stochastic Gradient Normalized by Layerwise Adaptive Second Moments
2019
Boris Ginsburg
Patrice Castonguay
Oleksii Hrinchuk
Oleksii Kuchaiev
Vitaly Lavrukhin
Ryan Leary
Jason Li
Huyen Nguyen
Yang Zhang
Jonathan M. Cohen
Keywords:
Normalization (statistics)
Applied mathematics
Mathematics
Correction
Source
Cite
Save
Machine Reading By IdeaReader
26
References
2
Citations
NaN
KQI
[]