thuml / Flowformer
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆315Updated 8 months ago
Alternatives and similar repositories for Flowformer:
Users that are interested in Flowformer are comparing it to the libraries listed below
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆188Updated 2 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆380Updated last year
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆250Updated last year
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆222Updated 2 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆206Updated 2 years ago
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆381Updated 2 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆360Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆471Updated last year
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆751Updated 10 months ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 2 years ago
- Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML 2021) and Ranking and Tuning Pre-trained…☆207Updated last year
- ☆277Updated 2 years ago
- Pytorch implementation for t-SNE with cuda to accelerate☆331Updated last year
- Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".☆189Updated 3 years ago
- A simple cross attention that updates both the source and target in one step☆164Updated 10 months ago
- Self-supervised contrastive learning for time series via time-frequency consistency☆466Updated 10 months ago
- ☆91Updated 4 months ago
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆170Updated last month
- About Code release for "SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling" (NeurIPS 2023 Spotlight), https://arxiv.…☆124Updated 10 months ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆259Updated 2 years ago
- Multi-head attention in PyTorch☆150Updated 6 years ago
- [ICLR 2021 top 3%] Is Attention Better Than Matrix Decomposition?☆331Updated 2 years ago
- Simba☆203Updated last year
- PyTorch implementation of the InfoNCE loss for self-supervised learning.☆528Updated last year
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆124Updated last year
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆76Updated 3 years ago
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆169Updated 2 years ago
- [IJCAI-21] "Time-Series Representation Learning via Temporal and Contextual Contrasting"☆405Updated 11 months ago
- Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"☆352Updated last year
- iFormer: Inception Transformer☆244Updated 2 years ago