mengxiayu / LLMSuperWeight
Code for studying the super weight in LLM
☆91Updated 3 months ago
Alternatives and similar repositories for LLMSuperWeight:
Users that are interested in LLMSuperWeight are comparing it to the libraries listed below
- ☆116Updated last month
- ☆125Updated last year
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆122Updated 7 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆58Updated last month
- Explorations into some recent techniques surrounding speculative decoding☆248Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.