thu-ml / TetraJet-MXFP4Training
View external linksLinks

Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
36Jun 20, 2025Updated 7 months ago

Alternatives and similar repositories for TetraJet-MXFP4Training

Users that are interested in TetraJet-MXFP4Training are comparing it to the libraries listed below

Sorting:

Are these results useful?