apple / ml-batchquantLinks
☆21Updated 2 years ago
Alternatives and similar repositories for ml-batchquant
Users that are interested in ml-batchquant are comparing it to the libraries listed below
Sorting:
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆20Updated last year
- Self-Conditioning Pre-Trained Language Models, ICML 2022☆31Updated 3 years ago
- Export utility for unconstrained channel pruned models☆71Updated 2 years ago
- ☆42Updated 2 years ago
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆17Updated last year
- Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …☆13Updated 2 years ago
- ☆23Updated 3 years ago
- DUET: 2D Structured and Approximately Equivariant Representations, ICML 2023☆18Updated 2 years ago
- ☆85Updated last year
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆79Updated last year
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆55Updated 2 years ago
- ☆206Updated 3 years ago
- Utility to test the performance of CoreML models.☆70Updated 5 years ago
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆109Updated 2 years ago
- ☆15Updated last year
- ptq4vm official repository☆22Updated 3 months ago
- A block oriented training approach for inference time optimization.☆33Updated 10 months ago
- ☆26Updated last year
- ☆42Updated last year
- Dynamic Neural Architecture Search Toolkit☆30Updated 7 months ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆13Updated 2 months ago
- ☆152Updated 2 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆45Updated 9 months ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆47Updated 2 years ago
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆109Updated 9 months ago
- This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.☆83Updated 8 months ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Updated last year