wangmaolin / nitiLinks

Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv

☆86

Alternatives and similar repositories for niti

Users that are interested in niti are comparing it to the libraries listed below

Sorting:

Tiiiger / QPyTorch
Low Precision Arithmetic Simulation in PyTorch
☆285Updated last year
jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated 2 years ago
hsharma35 / bitfusion
Simulator for BitFusion
☆102Updated 5 years ago
lucamocerino / Binary-Neural-Networks-PyTorch-1.0
BNNs (XNOR, BNN and DoReFa) implementation for PyTorch 1.0+
☆40Updated 2 years ago
yeshaokai / ADMM-NN
☆36Updated 6 years ago
cooooorn / Pytorch-XNOR-Net
XNOR-Net, with binary gemm and binary conv2d kernels, support both CPU and GPU.
☆86Updated 6 years ago
ehw-fit / tf-approximate
Approximate layers - TensorFlow extension
☆26Updated 5 months ago
IntelLabs / FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
☆111Updated 10 months ago
plumerai / rethinking-bnn-optimization
Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"
☆74Updated 5 years ago
yanghr / BSQ
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆42Updated 4 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆99Updated 4 years ago
sfox14 / block_minifloat
Training with Block Minifloat number representation
☆16Updated 4 years ago
jmluu / Awesome-Efficient-Training
A collection of research papers on efficient training of DNNs
☆69Updated 3 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
cornell-zhang / dnn-quant-ocs
DNN quantization with outlier channel splitting (ICML'19)
☆113Updated 5 years ago
ChengZhang-98 / llm-mixed-q
Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"
☆23Updated last year
Qualcomm-AI-research / FP8-quantization
☆161Updated 2 years ago
SHI-Labs / Any-Precision-DNNs
Any-Precision Deep Neural Networks (AAAI 2021)
☆61Updated 5 years ago
mrusci / training-mixed-precision-quantized-networks
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆50Updated last year
enyac-group / NeuralPower
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆21Updated 6 years ago
EECS-NTNU / bismo
BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing
☆142Updated 5 years ago
GATECH-EIC / AutoDNNchip
☆71Updated 5 years ago
cornell-zhang / dnn-gating
Conditional channel- and precision-pruning on neural networks
☆72Updated 5 years ago
allenbai01 / ProxQuant
ProxQuant: Quantized Neural Networks via Proximal Operators
☆30Updated 6 years ago
anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 5 years ago
stevenygd / WAGE.pytorch
Reproduction of WAGE in PyTorch.
☆44Updated 6 years ago
Qualcomm-AI-research / oscillations-qat
☆76Updated 3 years ago
stanford-mast / nn_dataflow
Explore the energy-efficient dataflow scheduling for neural networks.
☆229Updated 5 years ago
BradMcDanel / column-combine
☆27Updated 5 years ago
aojunzz / NM-sparsity
☆241Updated 2 years ago