pashu123 / TransformersLinks
Pytorch Implementation of Transformers Explained with Comments
☆15Updated 5 years ago
Alternatives and similar repositories for Transformers
Users that are interested in Transformers are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆46Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 4 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 2 years ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆14Updated 2 years ago
- PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)☆30Updated 4 years ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆23Updated 3 years ago
- Transformers w/o Attention, based fully on MLPs☆93Updated last year
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral☆61Updated last year
- Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf☆197Updated 10 months ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆129Updated 7 months ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆20Updated 2 years ago
- ☆50Updated 2 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆141Updated 3 years ago
- MONeT framework for reducing memory consumption of DNN training☆173Updated 4 years ago
- [NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network☆73Updated 4 years ago
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆23Updated 3 years ago
- ☆11Updated 3 years ago
- Overview of conditional computution and dynamic CNNs for computer vision, with a focus on reducing computational complexity☆42Updated 2 years ago
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Updated 2 years ago
- ☆32Updated 2 weeks ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 5 years ago
- ☆43Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- PyTorch implementation of Lambda Network and pretrained Lambda-ResNet☆54Updated 4 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago