A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
☆206Feb 10, 2025Updated last year
Alternatives and similar repositories for Awesome-Efficient-AIGC
Users that are interested in Awesome-Efficient-AIGC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co …☆2,396May 11, 2026Updated last month
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆72Jun 4, 2024Updated 2 years ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…☆110Sep 29, 2025Updated 8 months ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆830Mar 27, 2025Updated last year
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆25Oct 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Mar 4, 2024Updated 2 years ago
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆65Apr 15, 2024Updated 2 years ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Mar 25, 2025Updated last year
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated 2 years ago
- This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.☆89Jun 2, 2023Updated 3 years ago
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆103Mar 12, 2024Updated 2 years ago
- Awesome LLM compression research papers and tools.☆1,848Feb 23, 2026Updated 4 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.☆728May 14, 2026Updated last month
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆146Apr 1, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list for Efficient Large Language Models☆2,019Jun 17, 2025Updated last year
- [CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers☆44Apr 7, 2025Updated last year
- [ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs☆235Jan 11, 2025Updated last year
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆39Feb 4, 2024Updated 2 years ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- ☆12Jul 18, 2024Updated last year
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆131Sep 23, 2025Updated 9 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 11 months ago
- ☆15Mar 21, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.☆376Mar 21, 2024Updated 2 years ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- BGEMM-CUDA is a CUDA-based low-bit GEMM kernel library for efficient neural network inference. It implements optimized binary and ternary…☆20Aug 30, 2024Updated last year
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆545Sep 21, 2024Updated last year
- Minute-long video generation at 24FPS.☆68Mar 28, 2026Updated 3 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,662Jul 12, 2024Updated last year
- 📚 Collection of awesome generation acceleration resources.☆400Jul 7, 2025Updated 11 months ago
- [ECCV2024] VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation☆10Jul 4, 2024Updated last year
- This repository contains integer operators on GPUs for PyTorch.☆235Sep 29, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PyTorch codes for "Iterative Token Evaluation and Refinement for Real-World Super-Resolution", AAAI 2024☆59Oct 23, 2024Updated last year
- ☆13Jun 16, 2024Updated 2 years ago
- ☆16Sep 12, 2023Updated 2 years ago
- This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…☆15Feb 26, 2023Updated 3 years ago
- Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)☆92Jul 28, 2025Updated 11 months ago
- ☆19Feb 4, 2025Updated last year
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆722Aug 13, 2024Updated last year