john-cuffe / MAMBA2

Second Generation of the MAMBA Software

☆28

Alternatives and similar repositories for MAMBA2:

Users that are interested in MAMBA2 are comparing it to the libraries listed below

WailordHe / DenseSSM
A repository for DenseSSMs
☆86Updated 10 months ago
kyegomez / ViTAR
Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch
☆30Updated 3 months ago
iancovert / locality-alignment
☆41Updated 3 weeks ago
AmeenAli / HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆211Updated 8 months ago
OliverRensu / ARM
This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆68Updated 7 months ago
badripatro / mamba360
State Space Models
☆64Updated 9 months ago
AILab-CVC / M2PT
[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
☆98Updated 11 months ago
ethanbar11 / ssm_2d
More dimensions = More fun
☆21Updated 6 months ago
NVlabs / STL
Official Pytorch Implementation of Self-emerging Token Labeling
☆32Updated 10 months ago
UMass-Foundation-Model / FlexAttention
Official implementation for FlexAttention for Efficient High-Resolution Vision-Language Models
☆36Updated last month
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆52Updated 5 months ago
scale-lab / MTLoRA
The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning
☆39Updated 6 months ago
SJTU-DeepVisionLab / FLoRA
☆33Updated 6 months ago
Adamdad / rational_kat_cu
☆47Updated last week
Hon-Wong / ByteVideoLLM
This is the official repo for ByteVideoLLM/Dynamic-VLM
☆19Updated 2 months ago
MambaMixer / M2
☆45Updated 10 months ago
BaohaoLiao / mefts
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
☆30Updated last year
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆35Updated 4 months ago
wangf3014 / Mamba-Reg
☆55Updated 7 months ago
kyegomez / TTL
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
☆24Updated this week
Hprairie / Bi-Mamba2
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆60Updated last month
fla-org / flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
☆42Updated last week
caojiaolong / Awesome-Mamba
Collect papers about Mamba (a selective state space model).
☆14Updated 6 months ago
berlino / gated_linear_attention
☆99Updated 11 months ago
JiuTian-VL / MoME
[NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
☆45Updated 2 months ago
MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…
☆79Updated 11 months ago
mlfoundations / VisIT-Bench
☆47Updated last year
vulus98 / Rethinking-attention
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…
☆44Updated 2 months ago
ZhengYu518 / VL-Mamba
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
☆80Updated 10 months ago
bwconrad / soft-moe
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆50Updated last year