pkuyym / EvolvingAttention
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for EvolvingAttention
- PyTorch implementation of Pay Attention to MLPs☆39Updated 3 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆75Updated 10 months ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆25Updated 3 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆23Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆15Updated last year
- Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)☆12Updated last year
- ☆15Updated last year
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆20Updated last year
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆43Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- Transformer based Self-Attention for Complex Numbers☆11Updated 3 years ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆61Updated 6 months ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆14Updated 3 years ago
- SRTNet☆24Updated last year
- ☆10Updated 7 months ago
- ☆21Updated last year
- ☆41Updated 3 years ago
- "Learning Loss for Test-Time Augmentation (NeurIPS 2020)"☆9Updated 3 years ago
- ☆33Updated 4 years ago
- Mixture of Attention Heads☆39Updated 2 years ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆33Updated 5 months ago
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆67Updated 2 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆59Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆54Updated last year
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Updated 3 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 2 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)☆17Updated last year