methods: LR-optimizations¶ Cramming: Training a Language Model on a Single GPU in One Day — 2022, to-read How to Train State-Of-The-Art Models Using TorchVision’s Latest Primitives — 2021, deep-read