thuml / FlowformerLinks
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆325Updated last year
Alternatives and similar repositories for Flowformer
Users that are interested in Flowformer are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆196Updated 2 years ago
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆391Updated 3 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆212Updated 3 years ago
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆232Updated 2 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆396Updated last year
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆271Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 5 months ago
- ☆292Updated 3 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆369Updated 2 years ago
- PyTorch implementation of the GradNorm☆110Updated last year
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆638Updated 5 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆77Updated 4 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆252Updated 2 years ago
- Simba☆214Updated last year
- Multi-head attention in PyTorch☆154Updated 6 years ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆489Updated 2 years ago
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆125Updated 2 years ago
- An implementation of the efficient attention module.☆321Updated 4 years ago
- Source code of NeurIPS'22 paper: FiLM: Frequency improved Legendre Memory Model for Long-term Time Series Forecasting☆33Updated 2 years ago
- ☆248Updated this week
- The official implementation of the CVPR'22 paper SimVP: Simpler Yet Better Video Prediction.☆269Updated 2 years ago
- This is an official implementation code for paper "A Survey on Time-Series Pre-Trained Models" (TKDE-24).☆201Updated last year
- Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".☆198Updated 3 years ago
- A simple cross attention that updates both the source and target in one step☆182Updated 3 months ago
- About Code release for "SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling" (NeurIPS 2023 Spotlight), https://arxiv.…☆140Updated last year
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆206Updated 9 months ago
- Self-supervised contrastive learning for time series via time-frequency consistency☆502Updated last year
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆88Updated 3 years ago
- The code of the CIKM'25 paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series Forecasting"☆183Updated 2 months ago
- Pytorch implementation for t-SNE with cuda to accelerate☆336Updated 2 years ago