Separius / awesome-fast-attention
list of efficient attention modules
☆995Updated 3 years ago
Alternatives and similar repositories for awesome-fast-attention:
Users that are interested in awesome-fast-attention are comparing it to the libraries listed below
- label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful☆2,207Updated 3 months ago
- Some tricks of pytorch...☆1,177Updated 6 months ago
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,094Updated 2 years ago
- PyTorch implementation of Contrastive Learning methods☆1,952Updated last year
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆602Updated 6 months ago
- A PyTorch Implementation of Focal Loss.☆971Updated 5 years ago
- This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 202…☆954Updated 3 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆468Updated 4 years ago
- some tircks for PyTorch☆579Updated 5 years ago
- [arXiv 2019] "Contrastive Multiview Coding", also contains implementations for MoCo and InstDis☆1,310Updated 4 years ago
- Debug PyTorch code using PySnooper☆801Updated 3 years ago
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,242Updated 4 months ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆982Updated 3 months ago
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆787Updated 11 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,650Updated 5 months ago
- Implementing Attention Augmented Convolutional Networks using Pytorch☆646Updated 2 years ago
- [NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning☆740Updated 3 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,196Updated last year
- [CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator☆1,309Updated 3 years ago
- ☆870Updated 7 months ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,539Updated last year
- A curated list of Multimodal Related Research.☆1,331Updated last year
- The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition☆662Updated 2 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,140Updated last year
- Collection for Few-shot Learning☆972Updated last year
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆408Updated 5 months ago
- My best practice of training large dataset using PyTorch.☆1,093Updated 8 months ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,534Updated 4 years ago
- A curated list of resources for Learning with Noisy Labels☆2,648Updated 8 months ago
- torchsummaryX: Improved visualization tool of torchsummary☆301Updated 2 years ago