This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mamba-dive
Users that are interested in mamba-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆13Jul 23, 2023Updated 2 years ago
- ☆20Jun 15, 2023Updated 2 years ago
- ☆31Dec 29, 2023Updated 2 years ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆941Mar 3, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Nov 11, 2023Updated 2 years ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- ☆18Jan 9, 2024Updated 2 years ago
- Code repository for Black Mamba☆265Feb 8, 2024Updated 2 years ago
- Knowledge graph based information retrieval☆14Dec 26, 2018Updated 7 years ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆56Apr 12, 2024Updated 2 years ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆26Oct 15, 2023Updated 2 years ago
- code for "Automated and Intelligent Synthesis of Oxygen-Producing Catalysts from Martian Meteorites by Robotic AI-Chemist "☆12Jul 31, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository has implementations of various alternatives to backpropagation for training neural networks.☆25Jan 10, 2025Updated last year
- ☆18Oct 26, 2024Updated last year
- TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pyt…☆16Jul 5, 2024Updated last year
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- ☆18May 18, 2021Updated 5 years ago
- Salient Open Information Extraction☆20Nov 14, 2018Updated 7 years ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- ☆17Oct 27, 2020Updated 5 years ago
- The Official Repository of the Cryptonite Dataset☆23Feb 19, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆23May 28, 2025Updated last year
- Daily paper reading records☆15Mar 31, 2025Updated last year
- ☆54Nov 22, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆62Apr 8, 2024Updated 2 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆13Apr 11, 2025Updated last year
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 3 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- PaliGemma Inference and Fine Tuning☆13May 15, 2024Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆126May 11, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆64Jul 30, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- A minimal WebRTC SFU Implementation☆19Jun 15, 2025Updated 11 months ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Collect papers about Mamba (a selective state space model).☆15Aug 6, 2024Updated last year
- Jupyter Notebook running Mamba speech synthesis example on Determined AI. Based on https://2084.substack.com/p/2084-marcrandbot-speech-sy…☆23Feb 8, 2024Updated 2 years ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆18Apr 27, 2026Updated last month