IST-DASLab / qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
262Updated last year

Related projects

Alternatives and complementary repositories for qmoe