This is the code that went into our practical dive using mamba as information extraction
β57Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mamba-dive
Users that are interested in mamba-dive are comparing it to the libraries listed below
Sorting:
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)β22Jan 22, 2024Updated 2 years ago
- Mamba-Chat: A chat LLM based on the state-space model architecture πβ942Mar 3, 2024Updated 2 years ago
- Repository for paper Decrypting Cryptic Crosswordsβ10Jan 15, 2022Updated 4 years ago
- Language Model Accessor Object - Neural autocomplete for Overleafβ12May 2, 2023Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.β55Aug 31, 2024Updated last year
- β19Nov 11, 2023Updated 2 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIMβ61Apr 8, 2024Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.β54Apr 12, 2024Updated last year
- β13Jul 10, 2021Updated 4 years ago
- Detic + SAM for open-vocabulary object detection and segmentation.β19Nov 10, 2025Updated 3 months ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applicationsβ40Nov 11, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERTβ16Apr 25, 2019Updated 6 years ago
- β18Jan 9, 2024Updated 2 years ago
- β18Oct 26, 2024Updated last year
- Annotated version of the Mamba paperβ497Feb 27, 2024Updated 2 years ago
- This is a guide on how you can implement time series in RNNs using LSTMs to determine the future prices of bitcoinβ17May 27, 2018Updated 7 years ago
- β17May 18, 2021Updated 4 years ago
- This repository has implementations of various alternatives to backpropagation for training neural networks.β22Jan 10, 2025Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modelingβ215Jan 30, 2026Updated last month
- β23Nov 7, 2024Updated last year
- Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/β27Jun 17, 2024Updated last year
- Lecture notes and codeβ26Feb 6, 2020Updated 6 years ago
- Modeling code for a BitNet b1.58 Llama-style model.β25Apr 30, 2024Updated last year
- π Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)β25Oct 15, 2023Updated 2 years ago
- Official code for CFNetβ26May 17, 2024Updated last year
- Official code repository to the corresponding paper.β29Sep 14, 2023Updated 2 years ago
- WebAssembly HLS client written in Rustβ28Mar 16, 2018Updated 7 years ago
- A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplestβ¦β463Feb 13, 2026Updated 3 weeks ago
- Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogsβ26Mar 25, 2024Updated last year
- Cryptic crossword solverβ37Oct 9, 2017Updated 8 years ago
- A dataset of cryptic crossword clues, collected from various blogs and digital archives.β33Dec 4, 2022Updated 3 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zetaβ125Feb 6, 2026Updated last month
- Example for running IREE in a bare-metal Arm environment.β40Feb 24, 2026Updated last week
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.β2,920Mar 8, 2024Updated last year
- Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'β31Mar 17, 2023Updated 2 years ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retrainingβ736Apr 10, 2024Updated last year
- utilitiesβ15Jul 2, 2013Updated 12 years ago
- Clean RL implementation using MLXβ35Mar 8, 2024Updated last year
- β12Sep 1, 2023Updated 2 years ago