towhee.trainer.optimization.adafactorΒΆ

Classes

Adafactor

AdaFactor pytorch implementation as introduced in Adafactor: Adaptive Learning Rates with Sublinear Memory Cost https://arxiv.org/abs/1804.04235.