Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)
☆47Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for mamba_small_bench
Users that are interested in mamba_small_bench are comparing it to the libraries listed below
Sorting:
- ViT architecture with Mamba instead of transformer backbone☆18Dec 8, 2023Updated 2 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- Sui Version Manager☆19Jan 13, 2024Updated 2 years ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆23Oct 8, 2024Updated last year
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"☆21Oct 23, 2024Updated last year
- Cross Atlas Remapping via Optimal Transport☆12Dec 14, 2023Updated 2 years ago
- ☆24Jan 21, 2024Updated 2 years ago
- ☆28Jun 9, 2024Updated last year
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Nov 11, 2024Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆32Apr 9, 2025Updated 10 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆33Mar 5, 2024Updated 2 years ago
- A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.☆11Aug 19, 2023Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,920Mar 8, 2024Updated last year
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆14Nov 20, 2025Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- Implementation of vision transformer. ⭐⭐⭐☆33Oct 26, 2021Updated 4 years ago
- A cross-platform Sui SDK for Mobile, Web and Desktop☆37Nov 12, 2024Updated last year
- ☆15Mar 15, 2022Updated 3 years ago
- ☆11Nov 20, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- The code will come soon.☆15Sep 12, 2025Updated 5 months ago
- An analog, transistor-level simulation of an 8-bit CPU in SPICE☆13Jul 29, 2021Updated 4 years ago
- ☆12Feb 16, 2024Updated 2 years ago
- LFSMIM: A Low-Frequency Spectral Masked Image Modeling Method for Hyperspectral Image Classification☆12Mar 7, 2024Updated last year
- ☆46Nov 2, 2023Updated 2 years ago
- ☆43Oct 31, 2024Updated last year
- ☆12Apr 16, 2024Updated last year
- ☆11Dec 13, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated last year
- sEEG software☆17Feb 11, 2026Updated 3 weeks ago
- Code for "DeepPolar codes", ICML 2024☆12May 7, 2024Updated last year
- Matlab codes for Unsourced Multiple Access With Random User Activity by K.-H. Ngo, A. Lancho, G. Durisi, and A. Graell i Amat☆12May 26, 2023Updated 2 years ago
- ☆16Jun 14, 2024Updated last year
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆19Jun 17, 2025Updated 8 months ago
- This code is for project: [Exploiting Temporal Side Information in Massive IoT Connectivity] and [On Massive IoT Connectivity with Tempor…☆11Mar 18, 2024Updated last year
- Nano vLLM☆12Jun 26, 2025Updated 8 months ago