This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mamba-dive
Users that are interested in mamba-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- ☆20Jun 15, 2023Updated 2 years ago
- ☆31Dec 29, 2023Updated 2 years ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆940Mar 3, 2024Updated 2 years ago
- ☆19Nov 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Jan 9, 2024Updated 2 years ago
- Code repository for Black Mamba☆262Feb 8, 2024Updated 2 years ago
- Knowledge graph based information retrieval☆14Dec 26, 2018Updated 7 years ago
- ☆10Dec 4, 2023Updated 2 years ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆199Mar 18, 2026Updated 3 weeks ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆26Oct 15, 2023Updated 2 years ago
- code for "Automated and Intelligent Synthesis of Oxygen-Producing Catalysts from Martian Meteorites by Robotic AI-Chemist "☆12Jul 31, 2023Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Salient Open Information Extraction☆20Nov 14, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Annotated version of the Mamba paper☆500Feb 27, 2024Updated 2 years ago
- ☆17Oct 27, 2020Updated 5 years ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- ☆54Nov 22, 2024Updated last year
- This is a guide on how you can implement time series in RNNs using LSTMs to determine the future prices of bitcoin☆17May 27, 2018Updated 7 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Updated this week
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆64Jul 30, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/☆27Jun 17, 2024Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- "Head-to-Tail How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?" (NAACL 2024)☆19Jul 1, 2024Updated last year
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated last year
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Jul 2, 2021Updated 4 years ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 6 years ago
- Language Model Accessor Object - Neural autocomplete for Overleaf☆13May 2, 2023Updated 2 years ago
- Cryptic crossword solver☆38Oct 9, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆17Feb 28, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,944Mar 8, 2024Updated 2 years ago
- ☆14Jul 10, 2021Updated 4 years ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆25Oct 12, 2024Updated last year
- 1st place solution to the Breast Cancer Classification Task of HeLP Challenge 2019.☆14Apr 26, 2020Updated 5 years ago
- Detic + SAM for open-vocabulary object detection and segmentation.☆20Nov 10, 2025Updated 5 months ago