How AI Training Scales

1 · OpenAI · Dec. 14, 2018, 4:08 p.m.
We've discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks....