mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniques empowers users to customize their approaches according to specific requirements and constraints, providing a high level of flexibility.
☆25Nov 28, 2024Updated last year
Alternatives and similar repositories for MI-optimize
Users that are interested in MI-optimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆42Mar 4, 2026Updated 3 weeks ago
- Pulp virtual platform☆24Jul 16, 2025Updated 8 months ago
- Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"☆12Aug 18, 2021Updated 4 years ago
- Official PyTorch implementation of Fast-MoCo☆16Feb 20, 2023Updated 3 years ago
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Optimizing the Deployment of Tiny Transformers on Low-Power MCUs☆33Sep 2, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆19Jul 30, 2021Updated 4 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆49Mar 19, 2020Updated 6 years ago
- ☆42Dec 15, 2022Updated 3 years ago
- BNG Image Format Implementation☆12Sep 19, 2020Updated 5 years ago
- ☆53Jul 18, 2024Updated last year
- 3D reconstruction and Plane detection using plane-to-plane homography constraints for uncalibrated image pair under Manhattan World Assum…☆16Dec 2, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target☆18Jan 10, 2020Updated 6 years ago
- 🖥️ a toy riscv emulator☆14Oct 20, 2021Updated 4 years ago
- A simple C++17 header-only library for generating SVG plots☆10Mar 17, 2024Updated 2 years ago
- ☆20Mar 6, 2022Updated 4 years ago
- Codebase for the Progressive Mixed-Precision Decoding paper.☆19Jul 15, 2025Updated 8 months ago
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆26Oct 4, 2024Updated last year
- Hinton's Forward-Forward Algorithm for Deep Learning☆11Feb 6, 2023Updated 3 years ago
- Asynchronous I/O framework for C with coroutine scheduling☆16Jul 6, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- Simple C library for safely handling utf8 strings☆16Nov 30, 2014Updated 11 years ago
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.☆492Nov 26, 2024Updated last year
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆20Mar 7, 2024Updated 2 years ago
- ☆28Sep 3, 2025Updated 6 months ago
- ☆10Jul 16, 2016Updated 9 years ago
- Lightweight C plotting library without special dependencies for Linux and Win☆12Feb 19, 2021Updated 5 years ago
- ☆25Mar 20, 2021Updated 5 years ago
- ☆12Aug 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- Methods of Self Calibration☆20Jun 10, 2019Updated 6 years ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- ☆23Jun 12, 2023Updated 2 years ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Mar 12, 2026Updated 2 weeks ago
- Intel 8080 CPU emulator library in Rust☆20Nov 20, 2025Updated 4 months ago