pashu123 / TransformersLinks
Pytorch Implementation of Transformers Explained with Comments
☆15Updated 5 years ago
Alternatives and similar repositories for Transformers
Users that are interested in Transformers are comparing it to the libraries listed below
Sorting:
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆14Updated 2 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Updated 4 years ago
- Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)☆221Updated 5 months ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆23Updated 3 years ago
- MONeT framework for reducing memory consumption of DNN training☆174Updated 4 years ago
- ☆17Updated 5 years ago
- A Numpy implementation of a Convolutional Neural Network: slow & fast (im2col/col2im).☆58Updated 2 years ago
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆107Updated 2 years ago
- A collection of metrics to profile a single deep learning model or compare two different deep learning models☆27Updated last year
- Estimate/count FLOPS for a given neural network using pytorch☆306Updated 3 years ago
- PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)☆31Updated 4 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Updated 4 years ago
- Binarize convolutional neural networks using pytorch☆147Updated 3 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆162Updated 2 years ago
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆24Updated 3 years ago
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆47Updated 3 years ago
- Overview of conditional computution and dynamic CNNs for computer vision, with a focus on reducing computational complexity☆42Updated 3 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral☆61Updated last year
- Pytorch ImageNet training codes with various tricks, lr schedulers, distributed training, mixed precision training, DALI dataloader etc.☆18Updated 5 years ago
- Customized matrix multiplication kernels☆56Updated 3 years ago
- ☆57Updated 3 years ago
- My PhD manuscript LaTeX code and the slides for the defense☆10Updated 3 years ago
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆111Updated 2 years ago
- Automatic learning-rate scheduler☆46Updated 4 years ago
- Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)☆17Updated 4 years ago
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆141Updated 3 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆88Updated 3 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆160Updated 4 years ago