jmluu/Awesome-Efficient-Training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jmluu/Awesome-Efficient-Training)

jmluu / Awesome-Efficient-Training

A collection of research papers on efficient training of DNNs

☆69

Alternatives and similar repositories for Awesome-Efficient-Training

Users that are interested in Awesome-Efficient-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / CPT
View on GitHub
[ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…
☆31Mar 2, 2024Updated 2 years ago
chuliang007 / resnet20_training
View on GitHub
☆11Aug 2, 2024Updated last year
wangmaolin / niti
View on GitHub
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆92Jul 26, 2022Updated 4 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
Ryu1845 / hyena-jax
View on GitHub
Implementation of Hyena Hierarchy in JAX
☆10Apr 30, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
PolyArch / fp-diannao
View on GitHub
☆14Apr 8, 2025Updated last year
sfox14 / block_minifloat
View on GitHub
Training with Block Minifloat number representation
☆18May 2, 2021Updated 5 years ago
ehw-fit / tf-approximate
View on GitHub
Approximate layers - TensorFlow extension
☆27Apr 14, 2025Updated last year
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago
ntampouratzis / FPGA-based-LSTM
View on GitHub
A novel FPGA-based intent recognition systemutilizing deep recurrent neural networks
☆26Aug 25, 2021Updated 4 years ago
ASU-ESIC-FAN-Lab / RepNet
View on GitHub
☆13Jul 3, 2025Updated last year
A-suozhang / awesome-quantization-and-fixed-point-training
View on GitHub
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
☆161Dec 18, 2020Updated 5 years ago
charbel-sakr / Fixed-Point-Training
View on GitHub
Code needed to reproduce results from my ICLR 2019 paper on fixed-point quantization of the backprop algorithm.
☆10Jan 24, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Tiiiger / QPyTorch
View on GitHub
Low Precision Arithmetic Simulation in PyTorch
☆289May 20, 2024Updated 2 years ago
ziplab / Mesa
View on GitHub
This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".
☆120Dec 12, 2021Updated 4 years ago
Zhen-Dong / HAWQ
View on GitHub
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆463May 15, 2023Updated 3 years ago
zysxmu / DFSQ
View on GitHub
super-resolution; post-training quantization; model compression
☆14Nov 10, 2023Updated 2 years ago
ThisisBillhe / torch_quantizer
View on GitHub
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆25Mar 29, 2024Updated 2 years ago
comaniac / epoi
View on GitHub
Benchmark PyTorch Custom Operators
☆14Jul 6, 2023Updated 3 years ago
leimao / PyTorch-Static-Quantization
View on GitHub
PyTorch Static Quantization Example
☆41Apr 29, 2021Updated 5 years ago
da-steve101 / twn_generator
View on GitHub
Generate an FPGA design for a TWN
☆11Nov 4, 2019Updated 6 years ago
Ther-nullptr / Awesome-Transformer-Accleration
View on GitHub
Paper list for accleration of transformers
☆14Jul 1, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bohanzhuang / Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks
View on GitHub
This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"
☆20Aug 30, 2021Updated 4 years ago
boluoweifenda / WAGE
View on GitHub
Code example for the ICLR 2018 oral paper
☆150May 31, 2018Updated 8 years ago
e-dupuis / awesome-approximate-dnn
View on GitHub
Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment
☆29May 15, 2024Updated 2 years ago
SingularityKChen / dl_accelerator
View on GitHub
Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions
☆210Jun 25, 2020Updated 6 years ago
zxytim / arithmetic-encoding-compression
View on GitHub
☆11Apr 3, 2023Updated 3 years ago
parsa-epfl / HBFPEmulator
View on GitHub
ColTraIn HBFP Training Emulator
☆15Feb 16, 2023Updated 3 years ago
yxli2123 / LoSparse
View on GitHub
☆64Oct 17, 2023Updated 2 years ago
AI-Efficiency / Awesome-Model-Quantization
View on GitHub
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…
☆2,417Jul 10, 2026Updated 3 weeks ago
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HazyResearch / fly
View on GitHub
☆226Feb 21, 2023Updated 3 years ago
xshaun / sc22-ae
View on GitHub
☆15Nov 7, 2025Updated 8 months ago
SJTU-ECTL / VECBEE
View on GitHub
VECBEE: A Versatile Efficiency-Accuracy Configurable Batch Error Estimation Method for Greedy Approximate Logic Synthesis
☆13Mar 8, 2022Updated 4 years ago
qgwang-hust / GraSU
View on GitHub
A Fast Graph Update Library for FPGA-based Dynamic Graph Processing
☆10Dec 20, 2021Updated 4 years ago
harvard-cns / Harvard-CNS-Seminar
View on GitHub
Reading seminar in Harvard Cloud Networking and Systems Group
☆16Aug 29, 2022Updated 3 years ago
GATECH-EIC / ShiftAddNet
View on GitHub
[NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network
☆74Nov 16, 2020Updated 5 years ago
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year