thuml / FlowformerLinks
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆331Updated last year
Alternatives and similar repositories for Flowformer
Users that are interested in Flowformer are comparing it to the libraries listed below
Sorting:
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆392Updated 3 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆197Updated 3 years ago
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆234Updated 2 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆272Updated 3 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆218Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 8 months ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆372Updated 2 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆400Updated 2 years ago
- PyTorch implementation of the GradNorm☆117Updated last year
- Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".☆200Updated 3 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆254Updated 2 years ago
- Multi-head attention in PyTorch☆156Updated 6 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆77Updated 4 years ago
- ☆300Updated 3 years ago
- Pytorch implementation for t-SNE with cuda to accelerate☆340Updated 2 years ago
- Source code of NeurIPS'22 paper: FiLM: Frequency improved Legendre Memory Model for Long-term Time Series Forecasting☆33Updated 3 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆89Updated 4 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆642Updated 5 years ago
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆125Updated 2 years ago
- kmeans using PyTorch☆529Updated 2 years ago
- This is an official implementation code for paper "A Survey on Time-Series Pre-Trained Models" (TKDE-24).☆213Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆498Updated 2 years ago
- An implementation of the efficient attention module.☆327Updated 5 years ago
- Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML 2021) and Ranking and Tuning Pre-trained…☆211Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆194Updated 5 months ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 4 years ago
- ☆137Updated 2 years ago
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆608Updated 2 years ago
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆217Updated 11 months ago
- Implementation of Linformer for Pytorch☆306Updated 2 years ago