Separius / awesome-fast-attentionLinks
list of efficient attention modules
☆1,008Updated 3 years ago
Alternatives and similar repositories for awesome-fast-attention
Users that are interested in awesome-fast-attention are comparing it to the libraries listed below
Sorting:
- Some tricks of pytorch...☆1,191Updated 11 months ago
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,101Updated 2 years ago
- Debug PyTorch code using PySnooper☆798Updated 4 years ago
- label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful☆2,241Updated 7 months ago
- some tircks for PyTorch☆577Updated 5 years ago
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,275Updated 8 months ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆988Updated 7 months ago
- PyTorch implementation of Contrastive Learning methods☆1,984Updated last year
- A quickstart and benchmark for pytorch distributed training.☆1,666Updated 10 months ago
- A PyTorch Implementation of Focal Loss.☆981Updated 5 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,201Updated last year
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆608Updated 10 months ago
- Pytorch library for fast transformer implementations☆1,709Updated 2 years ago
- This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 202…☆970Updated 3 years ago
- knowledge distillation papers☆755Updated 2 years ago
- Model analyzer in PyTorch☆1,481Updated 2 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,531Updated 4 years ago
- Implementing Attention Augmented Convolutional Networks using Pytorch☆653Updated 3 years ago
- A curated list of resources for Learning with Noisy Labels☆2,681Updated last month
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,598Updated 2 years ago
- A curated list of Multimodal Related Research.☆1,348Updated last year
- An All-MLP solution for Vision, from Google AI☆1,024Updated 8 months ago
- My best practice of training large dataset using PyTorch.☆1,098Updated last year
- Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174☆598Updated 5 years ago
- [arXiv 2019] "Contrastive Multiview Coding", also contains implementations for MoCo and InstDis☆1,325Updated 4 years ago
- ☆879Updated last year
- PyTorch DataLoaders implemented with DALI for accelerating image preprocessing☆881Updated 4 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,169Updated last year
- Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.☆1,211Updated 3 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆336Updated 5 years ago