mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniques empowers users to customize their approaches according to specific requirements and constraints, providing a high level of flexibility.
☆25Nov 28, 2024Updated last year
Alternatives and similar repositories for MI-optimize
Users that are interested in MI-optimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆51Mar 4, 2026Updated 3 months ago
- Pulp virtual platform☆24Jul 16, 2025Updated 11 months ago
- Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"☆12Aug 18, 2021Updated 4 years ago
- Official PyTorch implementation of Fast-MoCo☆16Feb 20, 2023Updated 3 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62May 2, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆19Jul 30, 2021Updated 4 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆49Mar 19, 2020Updated 6 years ago
- ☆41Dec 15, 2022Updated 3 years ago
- C rewrite of a minimal Python JPEG decoder☆12Jan 2, 2019Updated 7 years ago
- BNG Image Format Implementation☆12Sep 19, 2020Updated 5 years ago
- Binary translation in Rust☆12Jun 22, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- ☆23Mar 15, 2024Updated 2 years ago
- ☆54Jul 18, 2024Updated last year
- ☆16Nov 25, 2022Updated 3 years ago
- 3D reconstruction and Plane detection using plane-to-plane homography constraints for uncalibrated image pair under Manhattan World Assum…☆16Dec 2, 2019Updated 6 years ago
- INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target☆18Jan 10, 2020Updated 6 years ago
- A simple C++17 header-only library for generating SVG plots☆10Mar 17, 2024Updated 2 years ago
- Machine Learning Function Approximation: This code implements the fully-connected Deep Neural Network (DNN) architectures considered in t…☆20Oct 27, 2020Updated 5 years ago
- ☆20Mar 6, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RISC-V Static Binary Translator☆18Mar 6, 2019Updated 7 years ago
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆25Oct 4, 2024Updated last year
- A profiling library for the Sega Dreamcast☆12Apr 21, 2026Updated last month
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 5 years ago
- Simple C library for safely handling utf8 strings☆16Nov 30, 2014Updated 11 years ago
- TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.☆21Mar 7, 2024Updated 2 years ago
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.☆514Nov 26, 2024Updated last year
- ☆30Sep 3, 2025Updated 9 months ago
- ☆10Jul 16, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Aug 26, 2022Updated 3 years ago
- Methods of Self Calibration☆20Jun 10, 2019Updated 7 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- manage my star project on github☆11Jul 23, 2020Updated 5 years ago
- FPGA简单入门☆12Nov 17, 2020Updated 5 years ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆45Mar 28, 2026Updated 2 months ago