mlpen / LookupFFNLinks
☆21Updated last year
Alternatives and similar repositories for LookupFFN
Users that are interested in LookupFFN are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆48Updated 2 years ago
- ☆59Updated last year
- ☆14Updated last year
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆17Updated 2 years ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆18Updated 8 months ago
- SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference☆50Updated 8 months ago
- ☆21Updated last week
- ☆15Updated 2 years ago
- ☆67Updated last year
- A collection of research papers on efficient training of DNNs☆69Updated 3 years ago
- ☆44Updated last year
- ☆29Updated last year
- ☆42Updated 2 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆64Updated last year
- ☆33Updated last year
- ☆27Updated 8 months ago
- Code for ICML 2021 submission☆34Updated 4 years ago
- Low-Rank Llama Custom Training☆23Updated last year
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…