Separius / awesome-fast-attention
list of efficient attention modules
☆989Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-fast-attention
- Some tricks of pytorch...☆1,160Updated 5 months ago
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆598Updated 4 months ago
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,085Updated last year
- DeLighT: Very Deep and Light-Weight Transformers☆467Updated 4 years ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,497Updated last year
- Debug PyTorch code using PySnooper☆802Updated 3 years ago
- PyTorch implementation of Contrastive Learning methods☆1,942Updated last year
- This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 202…☆946Updated 3 years ago
- ☆870Updated 5 months ago
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,227Updated 2 months ago
- some tircks for PyTorch☆578Updated 4 years ago
- An All-MLP solution for Vision, from Google AI☆1,003Updated 2 months ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆977Updated last month
- A quickstart and benchmark for pytorch distributed training.☆1,640Updated 3 months ago
- My take on a practical implementation of Linformer for Pytorch.☆407Updated 2 years ago
- [arXiv 2019] "Contrastive Multiview Coding", also contains implementations for MoCo and InstDis☆1,303Updated 4 years ago
- My best practice of training large dataset using PyTorch.☆1,086Updated 6 months ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,532Updated 4 years ago
- knowledge distillation papers☆741Updated last year
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,194Updated 10 months ago
- Deep Learning Experiment Management☆639Updated last year
- torchsummaryX: Improved visualization tool of torchsummary☆301Updated 2 years ago
- A curated list of Multimodal Related Research.☆1,315Updated last year
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆407Updated 3 months ago
- [NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning☆737Updated 3 years ago
- [ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods☆2,197Updated last year
- Collection for Few-shot Learning☆973Updated last year
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆785Updated 9 months ago
- Pytorch library for fast transformer implementations☆1,643Updated last year
- pytorch memory track code☆1,003Updated 3 years ago