Introducing GPipe, an Open Source Library for Efficiently Training Large-scale Neural Network Models

1 · Google AI Research · March 4, 2019, 7:04 p.m.
Posted by Yanping Huang, Software Engineer, Google AI Deep neural networks (DNNs) have advanced many machine learning tasks, including speech recognition, visual recognition, and language processing. Recent advances by BigGan, Bert, and GPT2.0 have shown that ever-larger DNN models lead to better task performance and past progress in visual recognition tasks has also shown a strong correlation between the model size and classification accuracy. For example, the winner of the 2014 ImageNet visual...