ethanbar11 / ssm_2dLinks
More dimensions = More fun
☆26Updated last year
Alternatives and similar repositories for ssm_2d
Users that are interested in ssm_2d are comparing it to the libraries listed below
Sorting:
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆86Updated 5 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- ☆48Updated last year
- ☆39Updated last year
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆229Updated last month
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆123Updated 7 months ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆47Updated 8 months ago
- ☆54Updated 2 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆142Updated 2 months ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆30Updated 11 months ago
- Visualizing representations with diffusion based conditional generative model.☆102Updated 2 years ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆75Updated 11 months ago
- The official repo of continuous speculative decoding☆30Updated 7 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆165Updated 9 months ago
- CatMAE☆14Updated last year
- [CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆100Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- State Space Models☆71Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆60Updated 11 months ago
- The official implementation of "[MASK] is All You Need"☆125Updated 3 months ago
- Collect papers about Mamba (a selective state space model).☆14Updated last year
- ☆57Updated 2 years ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆27Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆42Updated 7 months ago
- [CVPR 2024 Highlight] ImageNet-D☆44Updated last year
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆23Updated 3 months ago
- ☆34Updated 6 months ago
- ☆19Updated 10 months ago