A-suozhang/MixDQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/A-suozhang/MixDQ)

A-suozhang / MixDQ

[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

☆14

Alternatives and similar repositories for MixDQ

Users that are interested in MixDQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

imagination-research / LCSC
View on GitHub
[ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
☆16Feb 15, 2025Updated last year
thu-nics / MixDQ
View on GitHub
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆50Nov 27, 2024Updated last year
A-suozhang / ViDiT-Q
View on GitHub
☆15Mar 21, 2025Updated last year
RunpeiDong / DGMS
View on GitHub
[ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
☆11May 21, 2023Updated 3 years ago
enyac-group / evol-q
View on GitHub
Quantization in the Jagged Loss Landscape of Vision Transformers
☆13Oct 22, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Xingyu-Zheng / BinaryDM
View on GitHub
(ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models
☆25Oct 4, 2024Updated last year
GATECH-EIC / SuperTickets
View on GitHub
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Jul 7, 2022Updated 4 years ago
Intelligent-Computing-Lab-Panda / GPTAQ
View on GitHub
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆93Jul 28, 2025Updated last year
diaoenmao / Pruning-Deep-Neural-Networks-from-a-Sparsity-Perspective
View on GitHub
[ICLR 2023] Pruning Deep Neural Networks from a Sparsity Perspective
☆25Jan 8, 2024Updated 2 years ago
WeChatCV / UnderEraser
View on GitHub
From Understanding to Erasing: Towards Complete and Stable Video Object Removal
☆29Apr 7, 2026Updated 3 months ago
Aaronhuang-778 / SliM-LLM
View on GitHub
[ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
☆62Aug 9, 2024Updated last year
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆39Aug 20, 2024Updated last year
tinganchen / AlignQ
View on GitHub
[CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
☆11Jan 6, 2023Updated 3 years ago
iszry / DI2N-PTQ4DM
View on GitHub
Improved the performance of 8-bit PTQ4DM expecially on FID.
☆11Aug 30, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iamkanghyunchoi / falqon
View on GitHub
Official repository of paper [FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic, NeurIPS 2025]
☆21Dec 2, 2025Updated 7 months ago
haiquanlu / AlphaPruning
View on GitHub
[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
☆34Jun 9, 2025Updated last year
ChengZhang-98 / LQER
View on GitHub
Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"
☆19Jul 11, 2024Updated 2 years ago
LOG-postech / rethinking-LLM-pruning
View on GitHub
[EMNLP 2024] Official implementation of "Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimiza…
☆28Feb 21, 2025Updated last year
thu-nics / MBQ
View on GitHub
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆93Mar 17, 2025Updated last year
Qualcomm-AI-research / FP8-quantization
View on GitHub
☆172Mar 9, 2023Updated 3 years ago
ziplab / PTQD
View on GitHub
The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models
☆103Mar 12, 2024Updated 2 years ago
nota-github / ERGO
View on GitHub
ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency object…
☆19Feb 25, 2026Updated 5 months ago
ThisisBillhe / EfficientDM
View on GitHub
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…
☆73Jun 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EEESlab / CMix-NN
View on GitHub
CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices
☆52Mar 19, 2020Updated 6 years ago
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆212Nov 9, 2021Updated 4 years ago
wimh966 / QDrop
View on GitHub
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆132Sep 23, 2025Updated 10 months ago
SteveTsui / ReBNN
View on GitHub
☆12Nov 17, 2023Updated 2 years ago
iis-eth-zurich / hd_dvs
View on GitHub
Integrating Event-based Dynamic Vision Sensors with Sparse Hyperdimensional Computing
☆13Jul 9, 2020Updated 6 years ago
imagination-research / EEP
View on GitHub
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
☆25Nov 11, 2025Updated 8 months ago
lliai / EMQ-series
View on GitHub
[ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
☆29Dec 6, 2023Updated 2 years ago
ziliHarvey / Homographies-for-Plane-Detection-and-3D-Reconstruction
View on GitHub
3D reconstruction and Plane detection using plane-to-plane homography constraints for uncalibrated image pair under Manhattan World Assum…
☆16Dec 2, 2019Updated 6 years ago
shalomma / PytorchBottleneck
View on GitHub
Information Bottleneck in DNN with PyTorch
☆15Jul 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Hsu1023 / DuQuant
View on GitHub
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
☆187Apr 24, 2026Updated 3 months ago
Cornell-RelaxML / Hyperdimensional-Computing
View on GitHub
Official implementation for the paper "Understanding Hyperdimensional Computing for Parallel Single-Pass Learning"
☆25Jun 10, 2023Updated 3 years ago
EEESlab / CMSIS_NN-INTQ
View on GitHub
INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target
☆18Jan 10, 2020Updated 6 years ago
xuke225 / EQ-Net
View on GitHub
EQ-Net [ICCV 2023]
☆32Aug 15, 2023Updated 2 years ago
zer0int / CLIP-txt2img-diffusers-scripts
View on GitHub
Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗
☆13Sep 24, 2024Updated last year
ndexter / MLFA
View on GitHub
Machine Learning Function Approximation: This code implements the fully-connected Deep Neural Network (DNN) architectures considered in t…
☆20Oct 27, 2020Updated 5 years ago
YitingQu / UnsafeBench
View on GitHub
☆15Mar 5, 2026Updated 4 months ago