The Computation Limits of Deep Learning

Deep learning's recent history has been one of achievement: from triumphing over humans in the game of Go to world-leading performance in image classification, voice recognition, translation, and other tasks. But this progress has come with a voracious appetite for computing power. This project catalogs the extent of this dependency, showing that progress across a wide variety of applications is strongly reliant on increases in computing power. Extrapolating forward this reliance reveals that progress along current lines is rapidly becoming economically, technically, and environmentally unsustainable. Thus, continued progress in these applications will require dramatically more computationally-efficient methods, which will either have to come from changes to deep learning or from moving to other machine learning methods.

Benchmarks

Machine Translation on WMT2014 English-French

open
Transformer+BT (ADMIN init)
Noisy back-translation
mRASP+Fine-Tune
Transformer + R-Drop
Admin
BERT-fused NMT
MUSE(Paralllel Multi-scale Attention)
T5
Local Joint Self-attention
Depth Growing

Want to contribute?

You have access to our database where you can point out any errors or suggest changes

Go to database
App screenshot