mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniques empowers users to customize their approaches according to specific requirements and constraints, providing a high level of flexibility.
☆25Nov 28, 2024Updated last year
Alternatives and similar repositories for MI-optimize
Users that are interested in MI-optimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆50Mar 4, 2026Updated 2 months ago
- Pulp virtual platform☆24Jul 16, 2025Updated 10 months ago
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- Optimizing the Deployment of Tiny Transformers on Low-Power MCUs☆36Sep 2, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆49Mar 19, 2020Updated 6 years ago
- ☆42Dec 15, 2022Updated 3 years ago
- C rewrite of a minimal Python JPEG decoder☆12Jan 2, 2019Updated 7 years ago
- BNG Image Format Implementation☆12Sep 19, 2020Updated 5 years ago
- Binary translation in Rust☆12Jun 22, 2020Updated 5 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- ☆12Nov 17, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆54Jul 18, 2024Updated last year
- INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target☆18Jan 10, 2020Updated 6 years ago
- 🖥️ a toy riscv emulator☆14Oct 20, 2021Updated 4 years ago
- A simple C++17 header-only library for generating SVG plots☆10Mar 17, 2024Updated 2 years ago
- ☆20Mar 6, 2022Updated 4 years ago
- Codebase for the Progressive Mixed-Precision Decoding paper.☆19Jul 15, 2025Updated 10 months ago
- RISC-V instruction encoding/decoding☆12Mar 22, 2023Updated 3 years ago
- RISC-V Static Binary Translator☆18Mar 6, 2019Updated 7 years ago
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆25Oct 4, 2024Updated last year
- A profiling library for the Sega Dreamcast☆12Apr 21, 2026Updated last month
- Asynchronous I/O framework for C with coroutine scheduling☆16Jul 6, 2025Updated 10 months ago
- Simple C library for safely handling utf8 strings☆16Nov 30, 2014Updated 11 years ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆21Mar 7, 2024Updated 2 years ago
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.☆509Nov 26, 2024Updated last year
- ☆30Sep 3, 2025Updated 8 months ago
- ☆10Jul 16, 2016Updated 9 years ago
- Lightweight C plotting library without special dependencies for Linux and Win☆12Feb 19, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Methods of Self Calibration☆20Jun 10, 2019Updated 6 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- manage my star project on github☆11Jul 23, 2020Updated 5 years ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- ☆23Jun 12, 2023Updated 2 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆101Mar 14, 2026Updated 2 months ago