SqueezeAILab / open_source_projectsLinks
Open Source Projects from Pallas Lab
☆21Updated 4 years ago
Alternatives and similar repositories for open_source_projects
Users that are interested in open_source_projects are comparing it to the libraries listed below
Sorting:
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆110Updated last year
 - The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆48Updated 3 years ago
 - A collection of research papers on efficient training of DNNs☆69Updated 3 years ago
 - ☆36Updated last year
 - Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Updated 2 years ago
 - ☆163Updated 2 years ago
 - LLM Inference with Microscaling Format☆31Updated 11 months ago
 - The official implementation of the DAC 2024 paper GQA-LUT☆20Updated 10 months ago
 - ☆69Updated 3 months ago
 - ☆15Updated 3 years ago
 - Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆47Updated 2 years ago
 - ACL 2023☆39Updated 2 years ago
 - llama INT4 cuda inference with AWQ☆55Updated 9 months ago
 - ☆24Updated 7 months ago
 - Torch2Chip (MLSys, 2024)☆54Updated 7 months ago
 - [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Updated 3 years ago
 - ☆23Updated last year
 - BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.