thuml / FlowformerLinks
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆324Updated 11 months ago
Alternatives and similar repositories for Flowformer
Users that are interested in Flowformer are comparing it to the libraries listed below
Sorting:
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆390Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆193Updated 2 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆392Updated last year
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆228Updated 2 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆252Updated 2 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆207Updated 3 years ago
- Source code of NeurIPS'22 paper: FiLM: Frequency improved Legendre Memory Model for Long-term Time Series Forecasting☆33Updated 2 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆184Updated last month
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆624Updated 5 years ago
- Simba☆208Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆479Updated 2 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆269Updated 2 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆364Updated last year
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆190Updated 4 months ago
- Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML 2021) and Ranking and Tuning Pre-trained…☆209Updated last year
- The Implementation of "Auto-Lambda: Disentangling Dynamic Task Relationships" [TMLR 2022].☆135Updated 2 years ago
- ☆286Updated 3 years ago
- The official implementation of the paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series Forecasting…☆162Updated 4 months ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆115Updated 2 years ago
- This is an official implementation of "ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis" (ICLR 2024 Spotli…☆314Updated last year
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆777Updated last year
- Self-supervised contrastive learning for time series via time-frequency consistency☆479Updated last year
- ☆190Updated 2 years ago
- TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting☆181Updated 10 months ago
- An All-MLP solution for Vision, from Google AI☆1,025Updated 9 months ago
- This is an official implementation code for paper "A Survey on Time-Series Pre-Trained Models" (TKDE-24).☆180Updated 8 months ago
- A simple cross attention that updates both the source and target in one step☆173Updated last year
- [ICLR 2021 top 3%] Is Attention Better Than Matrix Decomposition?☆333Updated 2 years ago
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆105Updated 10 months ago
- PyTorch implementation of the GradNorm☆96Updated 9 months ago