Scaling Neural Networks with GPipe

Follow Up Recommendations