thuml / Flowformer
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
☆311Updated 7 months ago
Alternatives and similar repositories for Flowformer:
Users that are interested in Flowformer are comparing it to the libraries listed below
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆185Updated 2 years ago
- [CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427☆378Updated 2 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆204Updated 2 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated last year
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆376Updated last year
- ☆276Updated 2 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆248Updated last year
- Official implementation of the paper "FilterNet: Harnessing Frequency Filters for Time Series Forecasting"☆161Updated 3 weeks ago
- Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning☆599Updated 4 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆738Updated 9 months ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆358Updated last year
- Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"☆221Updated last year
- About Code release for "SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling" (NeurIPS 2023 Spotlight), https://arxiv.…☆120Updated 9 months ago
- This is an official implementation code for paper "A Survey on Time-Series Pre-Trained Models" (TKDE-24).☆168Updated 4 months ago
- The official implementation of the paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series Forecasting…☆146Updated 3 months ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆465Updated last year
- Self-supervised contrastive learning for time series via time-frequency consistency☆460Updated 9 months ago
- The Implementation of "Auto-Lambda: Disentangling Dynamic Task Relationships" [TMLR 2022].☆132Updated 2 years ago
- Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".☆187Updated 2 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆260Updated 2 years ago
- Official implementation of SAMformer, a transformer leveraging Sharpness-Aware Minimization and Channel-Wise Attention for Time Series Fo…☆148Updated 2 months ago
- Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]☆216Updated 9 months ago
- An up-to-date list of works on Multi-Task Learning☆330Updated 3 months ago
- FITS: Frequency Interpolation Time Series Analysis Baseline☆155Updated 2 months ago
- PyTorch code for CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting (ICLR 2022)☆221Updated last year
- This is an official implementation of "ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis" (ICLR 2024 Spotli…☆251Updated 10 months ago
- About code release of "Interpretable Weather Forecasting for Worldwide Stations with a Unified Deep Model", Nature Machine Intelligence, …☆172Updated last month
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆124Updated last year
- Simba☆201Updated 10 months ago
- ☆89Updated 3 months ago