buttercutter / Mamba_SSMView external linksLinks
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
☆22Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for Mamba_SSM
Users that are interested in Mamba_SSM are comparing it to the libraries listed below
Sorting:
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- MegEngine implementation of Diffusion Models.☆18Aug 8, 2022Updated 3 years ago
- ☆15May 8, 2017Updated 8 years ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆24Oct 8, 2024Updated last year
- Official implementation code of the paper: "TENT: Tensorized Encoder Transformer for temperature forecasting".☆20Jan 21, 2024Updated 2 years ago
- Efficient LSTM parallelization on smartphone GPU☆21Jun 7, 2017Updated 8 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Nov 11, 2024Updated last year
- LLM Inference with Microscaling Format☆34Nov 12, 2024Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 8 months ago
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- PyTorch implementation of Continuously Indexed Flows paper, with many baseline normalising flows☆31Sep 16, 2021Updated 4 years ago
- Some preliminary explorations of Mamba's context scaling.☆218Feb 8, 2024Updated 2 years ago
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆88Oct 22, 2024Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving☆39Dec 12, 2022Updated 3 years ago
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 2 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 2 years ago
- ☆15Nov 27, 2025Updated 2 months ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆89Mar 1, 2024Updated last year
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Building or integrating an LLM wrapper shouldn't take more than 10 minutes.☆12Feb 1, 2025Updated last year
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Aug 3, 2023Updated 2 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- Namespace encoding hierarchical relationships between proteins, protein families, and protein complexes.☆12Mar 9, 2021Updated 4 years ago
- A simple library-less CUDA implementation of the OneSweep sorting algorithm.☆11Feb 26, 2024Updated last year
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Vectorgraph Image Painter☆12Mar 24, 2019Updated 6 years ago
- ☆12Oct 26, 2022Updated 3 years ago
- ☆12Jul 9, 2021Updated 4 years ago
- POPGym Library in JAX☆12Apr 15, 2024Updated last year