thu-ml / TetraJet-MXFP4TrainingLinks

Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
14Updated 2 weeks ago

Alternatives and similar repositories for TetraJet-MXFP4Training

Users that are interested in TetraJet-MXFP4Training are comparing it to the libraries listed below

Sorting: