status: skimmed¶
- MIXED PRECISION TRAINING — 2026, skimmed
- Scaling Laws for Neural Language Models — 2020, skimmed
- Very Deep Convolutional Networks for Large-Scale Image Recognition — 2015, skimmed
- Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation — 2014, skimmed