pashu123 / TransformersLinks
Pytorch Implementation of Transformers Explained with Comments
☆15Updated 5 years ago
Alternatives and similar repositories for Transformers
Users that are interested in Transformers are comparing it to the libraries listed below
Sorting:
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆23Updated 3 years ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Updated 3 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Updated 4 years ago
- MONeT framework for reducing memory consumption of DNN training☆174Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 3 years ago
- A Numpy implementation of a Convolutional Neural Network: slow & fast (im2col/col2im).☆58Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆163Updated 3 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- A collection of metrics to profile a single deep learning model or compare two different deep learning models☆27Updated 2 years ago
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆109Updated 2 years ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 6 years ago
- ☆57Updated 3 years ago
- Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf☆198Updated last year
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆63Updated last year
- Estimate/count FLOPS for a given neural network using pytorch☆306Updated 3 years ago
- ☆17Updated 5 years ago
- ☆10Updated 3 years ago
- Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules☆43Updated 3 years ago
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆113Updated 2 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆22Updated 3 years ago
- Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)☆222Updated 8 months ago
- PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)☆31Updated 4 years ago
- ☆52Updated 2 years ago
- Courbariaux, Matthieu, Yoshua Bengio, and Jean-Pierre David. "Binaryconnect: Training deep neural networks with binary weights during pro…☆12Updated 5 years ago
- Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral☆62Updated 2 years ago
- Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)☆17Updated 4 years ago
- Binarize convolutional neural networks using pytorch☆149Updated 3 years ago
- How to implement Non Max Suppression (NMS) in PyTorch☆17Updated 3 years ago
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆23Updated 4 years ago