nblt / TWA

[ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions
27Updated last year

Related projects: