uanu2002 / JSQ

[ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization
148Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for JSQ