Caiyun-AI/DCFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Caiyun-AI/DCFormer)

Caiyun-AI / DCFormer

☆224

Alternatives and similar repositories for DCFormer

Users that are interested in DCFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

caiyunapp / tower-eye
View on GitHub
基于塔顶摄像头的能见度反算项目
☆15Feb 22, 2024Updated 2 years ago
yingtaoluo / PhyDL-NWP
View on GitHub
Official Code for a KDD 2025 paper "Physics-Guided Learning of Meteorological Dynamics for Weather Downscaling and Forecasting"
☆20Jul 22, 2025Updated last year
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
cnmetlab / pymetaf
View on GitHub
A python package for parsing metar & taf raw text
☆11Oct 16, 2025Updated 9 months ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
caiyunapp / cyeva
View on GitHub
一个通用的确定性预报准确率测评工具包
☆33Mar 1, 2026Updated 4 months ago
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
smonsays / hypernetwork-attention
View on GitHub
Official code for the paper "Attention as a Hypernetwork"
☆58Feb 24, 2026Updated 5 months ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
haiduo / PartialNet
View on GitHub
This repository is the official implementation of "Partial Channel Network: Compute Fewer, Perform Better". [AAAI 2026 Accepted]
☆40Feb 11, 2025Updated last year
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
OpenNLPLab / TransnormerLLM
View on GitHub
Official implementation of TransNormerLLM: A Faster and Better LLM
☆256Jan 23, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Phylliida / MambaLens
View on GitHub
Mamba support for transformer lens
☆20Sep 17, 2024Updated last year
rafapablos / w4c23-rainai
View on GitHub
Weather4Cast 2023 NeurIPS Competition - RainAI
☆16Dec 4, 2023Updated 2 years ago
berlino / gated_linear_attention
View on GitHub
☆107Mar 9, 2024Updated 2 years ago
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
caglarkucuk / earthformer-satellite-to-radar
View on GitHub
☆19Feb 14, 2024Updated 2 years ago
qhfan / RALA
View on GitHub
[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆44Mar 11, 2025Updated last year
krennic999 / STAR
View on GitHub
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
☆150Feb 19, 2025Updated last year
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆68Apr 24, 2024Updated 2 years ago
GeorgeMichailidis / multi-task-mixed-freq
View on GitHub
Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on Mixed Frequency Data", International Journal of Forecasting, 2…
☆13Feb 18, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
epfml / DenseFormer
View on GitHub
☆83Apr 16, 2024Updated 2 years ago
OSVAI / KernelWarehouse
View on GitHub
The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, published in ICML 2024)
☆101Jun 13, 2024Updated 2 years ago
chijames / KERPLE
View on GitHub
☆20Oct 25, 2022Updated 3 years ago
NVIDIA / HMM_sample_code
View on GitHub
CUDA 12.2 HMM demos
☆21Jul 26, 2024Updated 2 years ago
IST-DASLab / QIGen
View on GitHub
Repository for CPU Kernel Generation for LLM Inference
☆28Jul 13, 2023Updated 3 years ago
savinchand / owz_python
View on GitHub
☆13May 5, 2022Updated 4 years ago
LeapLabTHU / Agent-Attention
View on GitHub
[ECCV 2024] Official repository of Agent Attention
☆669Nov 17, 2024Updated last year
google-deepmind / spectral_ssm
View on GitHub
☆35Apr 12, 2024Updated 2 years ago
BryceZhuo / PolyCom
View on GitHub
The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".
☆18Apr 25, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tensorgi / TPA
View on GitHub
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (https://arxiv.org/abs/2501.06425)
☆460Jun 15, 2026Updated last month
x7zhong / TransfomerDownscaling
View on GitHub
This includes the code and data used in the paper "Investigating transformer-based models for downscaling near-surface temperature and wi…
☆28May 12, 2023Updated 3 years ago
hady1011 / OrthoNets
View on GitHub
Orthogonal Channel Attentions Networks
☆53Nov 7, 2023Updated 2 years ago
kostas1515 / AGLU
View on GitHub
[ECCV2024 - Oral] Adaptive Parametric Activation
☆53Nov 18, 2025Updated 8 months ago
frank-xwang / TBC-TiedBlockConvolution
View on GitHub
[AAAI 2021] Pytorch implementation for "Tied Block Convolution: Leaner and Better CNNs with Shared Thinner Filters."
☆40May 17, 2021Updated 5 years ago
davidleon / science_rcn
View on GitHub
Reference implementation of a two-level RCN model
☆11Nov 3, 2017Updated 8 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago