Hao840/Awesome-Low-Precision-Training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hao840/Awesome-Low-Precision-Training)

Hao840 / Awesome-Low-Precision-Training

A collection of research papers on low-precision training methods

☆69

Alternatives and similar repositories for Awesome-Low-Precision-Training

Users that are interested in Awesome-Low-Precision-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thu-ml / TetraJet-MXFP4Training
View on GitHub
Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
☆40May 4, 2026Updated 2 months ago
lose4578 / CircleRoPE
View on GitHub
☆15Sep 1, 2025Updated 10 months ago
jeffreyyu0602 / quantized-training
View on GitHub
☆35Dec 22, 2025Updated 7 months ago
NVlabs / COAT
View on GitHub
[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training
☆263Aug 9, 2025Updated 11 months ago
thu-ml / Jetfire-INT8Training
View on GitHub
☆63Jul 21, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LGrCo / L-GreCo
View on GitHub
AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION
☆16Oct 27, 2023Updated 2 years ago
Intelligent-Computing-Lab-Panda / GPTAQ
View on GitHub
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆92Jul 28, 2025Updated 11 months ago
microsoft / microxcaling
View on GitHub
PyTorch emulation library for Microscaling (MX)-compatible data formats
☆358Updated this week
ModelTC / msbench
View on GitHub
A tool for model sparse based on torch.fx
☆13Jun 3, 2024Updated 2 years ago
C0-Design / MemoryFormer
View on GitHub
An implementation is provided here for the NeurIPS2024 paper "MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected…
☆16Mar 24, 2026Updated 3 months ago
Hao840 / ADEM-VL
View on GitHub
PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"
☆21Oct 28, 2024Updated last year
aiha-lab / MX-QLLM
View on GitHub
LLM Inference with Microscaling Format
☆35Nov 12, 2024Updated last year
suzukimain / auto_diffusers
View on GitHub
diffusers with search engine
☆12Jan 13, 2026Updated 6 months ago
seungrokj / ai_sprint_paris
View on GitHub
☆14Jul 5, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / SS2_HRTF
View on GitHub
SS2 HRTF Dataset - Reality Labs Research Audio
☆18May 22, 2026Updated last month
haochengxi / Train_Transformers_with_INT4
View on GitHub
☆157Jun 22, 2023Updated 3 years ago
xunzhang1128 / Q-DiT4SR
View on GitHub
[ICML 2026] Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution
☆19May 1, 2026Updated 2 months ago
Audio-WestlakeU / pytorch_lightning_template_for_beginners
View on GitHub
A pytorch template for beginners based on pytorch_lightning
☆50Feb 1, 2024Updated 2 years ago
Hao840 / vanillaKD
View on GitHub
PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781
☆77Nov 21, 2023Updated 2 years ago
jy-yuan / KIVI
View on GitHub
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
☆419Nov 20, 2025Updated 8 months ago
mit-han-lab / fouroversix
View on GitHub
Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”
☆198Apr 21, 2026Updated 3 months ago
Kai-Liu001 / Awesome-Model-Quantization
View on GitHub
This repository contains low-bit quantization papers from 2020 to 2026 on top conference.
☆193Jun 25, 2026Updated 3 weeks ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆22Feb 18, 2026Updated 5 months ago
IST-DASLab / QuEST
View on GitHub
Work in progress.
☆80Nov 25, 2025Updated 7 months ago
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
IST-DASLab / FP-Quant
View on GitHub
☆114Feb 26, 2026Updated 4 months ago
AI9Stars / SpecMQuant
View on GitHub
Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design
☆23May 29, 2025Updated last year
Qualcomm-AI-research / BayesianBits
View on GitHub
☆22Feb 11, 2022Updated 4 years ago
SobeyMIL / TVG
View on GitHub
code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"
☆50Aug 19, 2024Updated last year
dhy2000 / x86Course
View on GitHub
BUAA X86 汇编程序设计课程笔记和实验作业
☆32Jun 12, 2022Updated 4 years ago
t123yh / MIPSCPU
View on GitHub
A simple MIPS CPU for BUAA CO course (and now NSCSCC).
☆10May 15, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
wafer-ai / chipbenchmark
View on GitHub
a platform for monitoring the chip situation
☆16Jul 19, 2025Updated last year
pierreHmbt / FedCP-QQ
View on GitHub
Federated Conformal Prediction with Quantile-of-Quantiles (FedCP-QQ)
☆11May 6, 2026Updated 2 months ago
Xtra-Computing / hacc_demo
View on GitHub
☆18Sep 25, 2025Updated 9 months ago
nbasyl / LLM-FP4
View on GitHub
The official implementation of the EMNLP 2023 paper LLM-FP4
☆225Dec 15, 2023Updated 2 years ago
IST-DASLab / Quartet
View on GitHub
☆127Mar 18, 2026Updated 4 months ago
JuanFMontesinos / torch_mir_eval
View on GitHub
Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.
☆35Jul 8, 2024Updated 2 years ago
JingyuanZhou / Task_Adaptive_Network
View on GitHub
☆11Nov 8, 2022Updated 3 years ago