A collection of research papers on low-precision training methods
☆64May 10, 2025Updated 10 months ago
Alternatives and similar repositories for Awesome-Low-Precision-Training
Users that are interested in Awesome-Low-Precision-Training are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆36Jun 20, 2025Updated 8 months ago
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- Implement Code for UniMix and Bayias Compensated Loss☆19Mar 7, 2023Updated 3 years ago
- A framework to compare low-bit integer and float-point formats☆66Feb 6, 2026Updated last month
- ☆21Feb 11, 2022Updated 4 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Oct 21, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆24Mar 29, 2024Updated last year
- Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)☆83Jul 28, 2025Updated 7 months ago
- Work in progress.☆79Nov 25, 2025Updated 3 months ago
- Convert PyObject* to C++ datatypes and vice versa☆29Nov 20, 2016Updated 9 years ago
- ☆11Apr 22, 2020Updated 5 years ago
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated 9 months ago
- Pytorch implementation of our paper accepted by TPAMI 2023 — Lottery Jackpots Exist in Pre-trained Models☆35Jun 19, 2023Updated 2 years ago
- Numbeo Unofficial API☆15Oct 16, 2022Updated 3 years ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training☆261Aug 9, 2025Updated 7 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- NART = NART is not A RunTime, a deep learning inference framework.☆37Mar 2, 2023Updated 3 years ago
- ☆157Jun 22, 2023Updated 2 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- ☆98Feb 26, 2026Updated last week
- ☆14Jul 5, 2025Updated 8 months ago
- This is the official PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".☆44Nov 25, 2021Updated 4 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- CreativeGAN: Editing Generative Adversarial Networks for Creative Design Synthesis☆12Jun 3, 2021Updated 4 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 9 months ago
- Swei Fist Sans-derived from Noto Sans CJK font family with a more concise & modern look. 獅尾詠春黑體基於思源黑體改造,擁有更加簡明現代化的字體家族。☆10Aug 3, 2021Updated 4 years ago
- ☆11Dec 30, 2021Updated 4 years ago
- ☆23Jul 11, 2025Updated 7 months ago
- ☆15Dec 2, 2025Updated 3 months ago
- Recommendation Model Implementation by using PyTorch☆10Nov 1, 2022Updated 3 years ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Sep 1, 2024Updated last year
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- A fonticulously fast font builder☆13May 20, 2022Updated 3 years ago
- diffusers with search engine☆12Jan 13, 2026Updated last month