mit-han-lab / neurips-micronetLinks

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

☆41

Alternatives and similar repositories for neurips-micronet

Users that are interested in neurips-micronet are comparing it to the libraries listed below

Sorting:

lottery-ticket / rewinding-iclr20-public
☆69Updated 5 years ago
stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
jfainberg / hashed_nets
PyTorch implementation of HashedNets
☆37Updated 2 years ago
HazyResearch / butterfly
Butterfly matrix multiplication in PyTorch
☆175Updated 2 years ago
TezRomacH / layer-to-layer-pytorch
PyTorch implementation of L2L execution algorithm
☆108Updated 2 years ago
princeton-nlp / DataMUX
[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks
☆60Updated 2 years ago
huggingface / block_movement_pruning
Block Sparse movement pruning
☆81Updated 4 years ago
ptillet / torch-blocksparse
Block-sparse primitives for PyTorch
☆160Updated 4 years ago
liamcli / darts_asha
Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."
☆22Updated 6 years ago
allenai / dnw
Discovering Neural Wirings (https://arxiv.org/abs/1906.00586)
☆136Updated 5 years ago
uber-research / permute-quantize-finetune
Using ideas from product quantization for state-of-the-art neural network compression.
☆146Updated 4 years ago
NVlabs / unas
Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral
☆61Updated 2 years ago
mit-han-lab / hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
☆336Updated last year
ischlag / fast-weight-transformers
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆108Updated 4 years ago
GATECH-EIC / Early-Bird-Tickets
[ICLR 2020] Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networks
☆140Updated 5 years ago
HayeonLee / MetaD2A
Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)
☆64Updated last year
evcu / pytorchpruner
☆22Updated 7 years ago
sIncerass / powernorm
[ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845
☆120Updated 4 years ago
facebookresearch / AlphaNet
AlphaNet Improved Training of Supernet with Alpha-Divergence
☆100Updated 4 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
jianweif / OptimalGradCheckpointing
☆41Updated 4 years ago
teddykoker / performer
Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/
☆38Updated 4 years ago
XinDongol / DNNAC
All about acceleration and compression of Deep Neural Networks
☆33Updated 6 years ago
HazyResearch / fly
☆220Updated 2 years ago
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆57Updated 3 years ago
xiaomi-automl / MixPath
MixPath: A Unified Approach for One-shot Neural Architecture Search
☆29Updated 5 years ago
asappresearch / flop
Pytorch library for factorized L0-based pruning.
☆45Updated 2 years ago
chrundle / biprop
Identify a binary weight or binary weight and activation subnetwork within a randomly initialized network by only pruning and binarizing …
☆51Updated 3 years ago
cybertronai / imagenet18
Train ImageNet in 18 minutes on AWS
☆133Updated last year
facebookresearch / diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …
☆236Updated 2 years ago