thuml / FlowformerLinks
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆328Updated last year
Alternatives and similar repositories for Flowformer
Users that are interested in Flowformer are comparing it to the libraries listed below
Sorting:
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆391Updated 3 years ago
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆232Updated 2 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆216Updated 3 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆370Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆196Updated 2 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 6 months ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆397Updated last year
- Source code of NeurIPS'22 paper: FiLM: Frequency improved Legendre Memory Model for Long-term Time Series Forecasting☆33Updated 3 years ago
- ☆293Updated 3 years ago
- Simba☆214Updated last year
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆207Updated 9 months ago
- A simple cross attention that updates both the source and target in one step☆185Updated 3 months ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆77Updated 4 years ago
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆88Updated 3 years ago
- This is an official implementation code for paper "A Survey on Time-Series Pre-Trained Models" (TKDE-24).☆204Updated last year
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆271Updated 3 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆253Updated 2 years ago
- Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".☆198Updated 3 years ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆640Updated 5 years ago
- The code of the CIKM'25 paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Time Series Forecasting"☆195Updated this week
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆489Updated 2 years ago
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆125Updated 2 years ago
- The official implementation of the CVPR'22 paper SimVP: Simpler Yet Better Video Prediction.☆271Updated 2 years ago
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆118Updated last year
- ☆248Updated 3 weeks ago
- About Code release for "SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling" (NeurIPS 2023 Spotlight), https://arxiv.…☆142Updated last year
- PyTorch implementation of the GradNorm☆111Updated last year
- Repository for the paper: 'Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models'☆321Updated 5 months ago
- About model release for "Sundial: A Family of Highly Capable Time Series Foundation Models" (ICML 2025 Oral)☆149Updated 2 months ago
- Multi-head attention in PyTorch☆154Updated 6 years ago