pkuyym / EvolvingAttention

☆14

Related projects ⓘ

Alternatives and complementary repositories for EvolvingAttention

jaketae / g-mlp
PyTorch implementation of Pay Attention to MLPs
☆39Updated 3 years ago
VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆75Updated 10 months ago
jaketae / fnet
PyTorch implementation of FNet: Mixing Tokens with Fourier transforms
☆25Updated 3 years ago
VITA-Group / layerGraftedPretraining_ICLR23
[ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…
☆23Updated last year
GATECH-EIC / S3-Router
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆15Updated last year
robflynnyh / hydra-linear-attention
Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)
☆12Updated last year
thu-ml / LM-Calibration
☆15Updated last year
AppleHolic / FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
☆13Updated 3 years ago
mingu6 / declarativedtw
Reference implementation of DecDTW in PyTorch (ICLR 2023)
☆20Updated last year
davidsvy / cosformer-pytorch
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
☆43Updated 3 years ago
DCGM / SoftCTC
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Updated last year
vkothapally / Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
☆11Updated 3 years ago
OpenNLPLab / HGRN
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆61Updated 6 months ago
kuijiang94 / leetcode-master
LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
☆14Updated 3 years ago
zhibinQiu / SRTNet
SRTNet
☆24Updated last year
cschaefer26 / StyleMelGAN
☆10Updated 7 months ago
wjxts / RegularizedBN
☆21Updated last year
keivanalizadeh / ButterflyTransform
☆41Updated 3 years ago
kakaobrain / learning-loss-for-tta
"Learning Loss for Test-Time Augmentation (NeurIPS 2020)"
☆9Updated 3 years ago
LiQiufu / WaveSNet
☆33Updated 4 years ago
yikangshen / MoA
Mixture of Attention Heads
☆39Updated 2 years ago
lessw2020 / FAdam_PyTorch
an implementation of FAdam (Fisher Adam) in PyTorch
☆33Updated 5 months ago
erksch / fnet-pytorch
Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.
☆67Updated 2 years ago
pkuzengqi / Skyformer
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)
☆59Updated 2 years ago
giannisdaras / cdm
[NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"
☆54Updated last year
PRIS-CV / AdvancedDropout
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)
☆17Updated 3 years ago
lucidrains / insertion-deletion-ddpm
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30Updated 2 years ago
lucidrains / local-attention-flax
Local Attention - Flax module for Jax
☆20Updated 3 years ago
LUMIA-Group / APL
The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)
☆17Updated last year