How AI Training Scales

0 · OpenAI · Dec. 14, 2018, 4:08 p.m.
Summary
We've discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks....