Dao-AILab/causal-conv1d

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dao-AILab/causal-conv1d)

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

☆744

Alternatives and similar repositories for causal-conv1d

Users that are interested in causal-conv1d are comparing it to the libraries listed below

Sorting:

state-spaces / mamba
View on GitHub
Mamba SSM architecture
☆17,311Feb 18, 2026Updated 2 weeks ago
MzeroMiko / VMamba
View on GitHub
VMamba: Visual State Space Models，code is based on mamba
☆3,054Mar 7, 2025Updated last year
hustvl / Vim
View on GitHub
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
☆3,815Feb 13, 2025Updated last year
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations of state-of-the-art linear attention models
☆4,474Updated this week
alxndrTL / mamba.py
View on GitHub
A simple and efficient Mamba implementation in pure PyTorch and MLX.
☆1,434Jan 26, 2026Updated last month
JCruan519 / VM-UNet
View on GitHub
(ACM TOMM) This is the official code repository for "VM-UNet: Vision Mamba UNet for Medical Image Segmentation".
☆800Sep 3, 2025Updated 6 months ago
bowang-lab / U-Mamba
View on GitHub
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation
☆961Apr 4, 2024Updated last year
HazyResearch / flash-fft-conv
View on GitHub
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
☆343Dec 28, 2024Updated last year
johnma2006 / mamba-minimal
View on GitHub
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,920Mar 8, 2024Updated 2 years ago
yyyujintang / Awesome-Mamba-Papers
View on GitHub
Awesome Papers related to Mamba.
☆1,389Oct 17, 2024Updated last year
Dao-AILab / fast-hadamard-transform
View on GitHub
Fast Hadamard transform in CUDA, with a PyTorch interface
☆285Oct 19, 2025Updated 4 months ago
jzhang38 / LongMamba
View on GitHub
Some preliminary explorations of Mamba's context scaling.
☆218Feb 8, 2024Updated 2 years ago
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
Hprairie / Bi-Mamba2
View on GitHub
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆78Dec 18, 2024Updated last year
NVlabs / MambaVision
View on GitHub
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
☆2,039Feb 9, 2026Updated last month
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated last year
state-spaces / s4
View on GitHub
Structured state space sequence models
☆2,854Jul 17, 2024Updated last year
HazyResearch / based
View on GitHub
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
☆249Jun 6, 2025Updated 9 months ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
lxxue / prefix_sum
View on GitHub
A PyTorch wrapper of parallel exclusive scan in CUDA
☆12May 25, 2023Updated 2 years ago
srush / annotated-mamba
View on GitHub
Annotated version of the Mamba paper
☆497Feb 27, 2024Updated 2 years ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆22,460Updated this week
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆18Nov 19, 2024Updated last year
pytorch / ao
View on GitHub
PyTorch native quantization and sparsity for training and inference
☆2,722Updated this week
meta-pytorch / attention-gym
View on GitHub
Helpful tools and examples for working with flex-attention
☆1,140Feb 8, 2026Updated last month
tridao / flash-attention-wheels
View on GitHub
☆61Nov 27, 2023Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
NVlabs / GatedDeltaNet
View on GitHub
[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule
☆490Feb 17, 2026Updated 2 weeks ago
AmeenAli / HiddenMambaAttn
View on GitHub
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆231Oct 16, 2025Updated 4 months ago
sjelassi / transformers_ssm_copy
View on GitHub
☆36Feb 26, 2024Updated 2 years ago
HazyResearch / zoology
View on GitHub
Understand and test language model architectures on synthetic tasks.
☆257Feb 24, 2026Updated 2 weeks ago
hunto / LocalMamba
View on GitHub
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
☆275May 6, 2024Updated last year
VISION-SJTU / QuadMamba
View on GitHub
Official code for [NeurIPS 2024] QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
☆46Aug 4, 2025Updated 7 months ago
Ruixxxx / Awesome-Vision-Mamba-Models
View on GitHub
[Official Repo] Visual Mamba: A Survey and New Outlooks
☆733Feb 18, 2025Updated last year
HazyResearch / ThunderKittens
View on GitHub
Tile primitives for speedy kernels
☆3,202Feb 24, 2026Updated last week
YuHengsss / VSSD
View on GitHub
[ICCV2025] Introduce Mamba2 to Vision.
☆186Oct 29, 2025Updated 4 months ago
goombalab / hydra
View on GitHub
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆170Jan 30, 2025Updated last year
WailordHe / DenseSSM
View on GitHub
A repository for DenseSSMs
☆89Apr 11, 2024Updated last year
nobodyplayer1 / VM-UNetV2
View on GitHub
☆119Mar 15, 2024Updated last year