adelmanm / approx
Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"
☆10Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for approx
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆56Updated 2 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Updated 5 years ago
- An Attention Superoptimizer☆20Updated 6 months ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 3 years ago
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆13Updated last year
- Pytorch library for factorized L0-based pruning.☆43Updated last year
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆46Updated 3 years ago
- ☆10Updated 2 years ago
- Block Sparse movement pruning☆78Updated 3 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆28Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆61Updated last month
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆59Updated last year
- PyTorch implementation of HashedNets☆36Updated last year
- ☆17Updated 4 years ago
- This package implements THOR: Transformer with Stochastic Experts.☆61Updated 3 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆14Updated 2 years ago
- ☆32Updated 3 years ago
- Distributed K-FAC Preconditioner for PyTorch☆80Updated this week
- Factorized Neural Layers☆27Updated last year
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- An adaptive training algorithm for residual network☆14Updated 4 years ago
- CBench, Benchmarking System for Question Answering Over Knowledge Graphs Systems.☆10Updated 2 years ago
- ☆17Updated last year
- Differentiable Product Quantization for End-to-End Embedding Compression.☆58Updated last year
- PyTorch implementation of Proximal Gradient Algorithms a la Parikh and Boyd (2014). Useful for Auto-Sizing (Murray and Chiang 2015, Murra…☆40Updated 4 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 3 years ago
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion☆40Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆17Updated 2 years ago
- ☆14Updated 2 years ago
- ☆43Updated 4 years ago