pashu123 / TransformersLinks

Pytorch Implementation of Transformers Explained with Comments

☆15

Alternatives and similar repositories for Transformers

Users that are interested in Transformers are comparing it to the libraries listed below

Sorting:

lehduong / NPTM
Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)
☆17Updated 3 years ago
GATECH-EIC / SuperTickets
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆21Updated 3 years ago
lecoan / pytorch-RLE
A implement of run-length encoding for Pytorch tensor using CUDA
☆14Updated 4 years ago
zheng-ningxin / Pruning-from-scratch
☆17Updated 5 years ago
thomasverelst / awesome-dynamic-conditional-networks-cv
Overview of conditional computution and dynamic CNNs for computer vision, with a focus on reducing computational complexity
☆42Updated 3 years ago
google-research / nested-transformer
Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
☆198Updated 11 months ago
Deeplite / deeplite-profiler
A collection of metrics to profile a single deep learning model or compare two different deep learning models
☆26Updated last year
cogsys-tuebingen / uninas
A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).
☆23Updated 3 years ago
uber-research / permute-quantize-finetune
Using ideas from product quantization for state-of-the-art neural network compression.
☆145Updated 3 years ago
CerebrasResearch / RevBiFPN
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
☆14Updated 2 years ago
isaaccorley / mlp-mixer-pytorch
PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)
☆31Updated 4 years ago
frgfm / torch-scan
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)
☆218Updated 4 months ago
DequanWang / CoDeNet
Algorithm-hardware Co-design for Deformable Convolution
☆24Updated 4 years ago
utsaslab / MONeT
MONeT framework for reducing memory consumption of DNN training
☆173Updated 4 years ago
lmbxmu / RBNN
Pytorch implementation of our paper accepted by NeurIPS 2020 -- Rotated Binary Neural Network
☆82Updated 2 years ago
Sharath-girish / LilNetX
Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
☆46Updated 3 years ago
ashaw596 / squeezenas
☆69Updated 5 years ago
fawazsammani / awesome-mlp-mixer
Transformers w/o Attention, based fully on MLPs
☆93Updated last year
IST-DASLab / spdy
Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"
☆20Updated 2 years ago
NVlabs / unas
Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral
☆61Updated last year
lucaslie / torchprune
A research library for pytorch-based neural network pruning, compression, and more.
☆162Updated 2 years ago
JiaxiongQ / SlimConv
Reducing Channel Redundancy in Convolutional Neural Networks by Features Recombining (TIP 2021)
☆18Updated 2 years ago
3outeille / CNNumpy
A Numpy implementation of a Convolutional Neural Network: slow & fast (im2col/col2im).
☆58Updated 2 years ago
ThomasRobertFr / thesis
My PhD manuscript LaTeX code and the slides for the defense
☆10Updated 3 years ago
lehduong / kesi
Knowledge distillation from Ensembles of Iterative pruning (BMVC 2020)
☆26Updated 4 years ago
vra / flopth
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
☆129Updated 8 months ago
XinDongol / DNNAC
All about acceleration and compression of Deep Neural Networks
☆33Updated 5 years ago
GATECH-EIC / ShiftAddNet
[NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network
☆73Updated 4 years ago
FrancescoSaverioZuppichini / non-max-suppression-in-pytorch
How to implement Non Max Suppression (NMS) in PyTorch
☆17Updated 2 years ago
YukeWang96 / DSXplore_IPDPS21
Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.
☆13Updated 4 years ago