Brett Winton: 通过在 AI 训练上花费更多的前期费用并为模型提供更多数据来减少端点设备上的 AI 推理负载。在本文中，Google 通过在约 4 倍的数据上花费约 50% 的更多计算，以约一半的推理计算负载获得更好的模型性能。 ai.googleblog.com/2021/12/more-e…

Posted on 2022-08-17

原推：Reduce AI inference load at the endpoint device by spending more upfront on AI training and feeding the model more data.

In this paper Google gets better model performance at ~half the inference compute load by spending ~50% more compute on ~4x the data.