mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniques empowers users to customize their approaches according to specific requirements and constraints, providing a high level of flexibility.
☆25Nov 28, 2024Updated last year
Alternatives and similar repositories for MI-optimize
Users that are interested in MI-optimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49Mar 4, 2026Updated 2 months ago
- Pulp virtual platform☆24Jul 16, 2025Updated 9 months ago
- Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"☆12Aug 18, 2021Updated 4 years ago
- Official PyTorch implementation of Fast-MoCo☆16Feb 20, 2023Updated 3 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62May 2, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Optimizing the Deployment of Tiny Transformers on Low-Power MCUs☆35Sep 2, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆49Mar 19, 2020Updated 6 years ago
- C rewrite of a minimal Python JPEG decoder☆12Jan 2, 2019Updated 7 years ago
- BNG Image Format Implementation☆12Sep 19, 2020Updated 5 years ago
- Binary translation in Rust☆12Jun 22, 2020Updated 5 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- ☆12Nov 17, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆53Jul 18, 2024Updated last year
- 3D reconstruction and Plane detection using plane-to-plane homography constraints for uncalibrated image pair under Manhattan World Assum…☆16Dec 2, 2019Updated 6 years ago
- [CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification☆12Jul 12, 2024Updated last year
- INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target☆18Jan 10, 2020Updated 6 years ago
- A simple C++17 header-only library for generating SVG plots☆10Mar 17, 2024Updated 2 years ago
- Machine Learning Function Approximation: This code implements the fully-connected Deep Neural Network (DNN) architectures considered in t…☆20Oct 27, 2020Updated 5 years ago
- Codebase for the Progressive Mixed-Precision Decoding paper.☆19Jul 15, 2025Updated 9 months ago
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆25Oct 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆68Mar 27, 2025Updated last year
- A profiling library for the Sega Dreamcast☆12Apr 21, 2026Updated 2 weeks ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago
- Asynchronous I/O framework for C with coroutine scheduling☆16Jul 6, 2025Updated 10 months ago
- Simple C library for safely handling utf8 strings☆16Nov 30, 2014Updated 11 years ago
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.☆506Nov 26, 2024Updated last year
- Lightweight C plotting library without special dependencies for Linux and Win☆12Feb 19, 2021Updated 5 years ago
- Official Implementation for CVPR'2025 "EVOS:Enhancing Implicit Neural Representations via Symmetric Power Transformation".☆17Apr 5, 2025Updated last year
- ☆12Aug 26, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Methods of Self Calibration☆20Jun 10, 2019Updated 6 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- FPGA简单入门☆12Nov 17, 2020Updated 5 years ago
- manage my star project on github☆11Jul 23, 2020Updated 5 years ago
- SPU-PMD: Self-Supervised Point Cloud Upsampling via Progressive Mesh Deformation (CVPR 2024)☆13Nov 5, 2025Updated 6 months ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago