OpenGVLab / LLMPrune-BESA

BESA is a differentiable weight pruning technique for large language models.
13Updated 8 months ago

Related projects

Alternatives and complementary repositories for LLMPrune-BESA