Separius / awesome-fast-attentionLinks
list of efficient attention modules
☆1,012Updated 4 years ago
Alternatives and similar repositories for awesome-fast-attention
Users that are interested in awesome-fast-attention are comparing it to the libraries listed below
Sorting:
- Some tricks of pytorch...☆1,194Updated last year
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,109Updated 2 years ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆992Updated 11 months ago
- label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful☆2,249Updated 10 months ago
- A PyTorch Implementation of Focal Loss.☆987Updated 5 years ago
- Debug PyTorch code using PySnooper☆799Updated 4 years ago
- My best practice of training large dataset using PyTorch.☆1,101Updated last year
- PyTorch implementation of Contrastive Learning methods☆1,988Updated last year
- some tircks for PyTorch☆576Updated 5 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,203Updated last year
- A quickstart and benchmark for pytorch distributed training.☆1,666Updated last year
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,625Updated 2 years ago
- A comprehensive list of awesome contrastive self-supervised learning papers.☆1,289Updated last year
- An All-MLP solution for Vision, from Google AI☆1,039Updated 2 months ago
- pytorch memory track code☆1,018Updated 4 years ago
- This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 202…☆975Updated 3 years ago
- Unofficial implementation of: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics☆556Updated 3 years ago
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆800Updated last year
- 深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)☆754Updated last year
- knowledge distillation papers☆758Updated 2 years ago
- FSL-Mate: A collection of resources for few-shot learning (FSL).☆1,752Updated last month
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆611Updated last year
- A multi-task learning example for the paper https://arxiv.org/abs/1705.07115☆866Updated 5 years ago
- ☆882Updated last year
- Collection for Few-shot Learning☆984Updated 2 years ago
- My take on a practical implementation of Linformer for Pytorch.☆419Updated 3 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Updated 4 years ago
- [NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning☆759Updated 4 years ago
- A curated list of papers, code and resources pertaining to zero shot learning☆930Updated 4 years ago
- A curated list of resources for Learning with Noisy Labels☆2,703Updated 4 months ago