This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for mamba-dive
Users that are interested in mamba-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Jun 15, 2023Updated 3 years ago
- ☆31Dec 29, 2023Updated 2 years ago
- ☆19Nov 11, 2023Updated 2 years ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- ☆18Jan 9, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code repository for Black Mamba☆265Feb 8, 2024Updated 2 years ago
- Knowledge graph based information retrieval☆14Dec 26, 2018Updated 7 years ago
- ☆10Dec 4, 2023Updated 2 years ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 3 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆56Apr 12, 2024Updated 2 years ago
- code for "Automated and Intelligent Synthesis of Oxygen-Producing Catalysts from Martian Meteorites by Robotic AI-Chemist "☆12Jul 31, 2023Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- ☆18May 18, 2021Updated 5 years ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Oct 27, 2020Updated 5 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- LangChain Agent☆11Nov 25, 2025Updated 6 months ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆62Apr 8, 2024Updated 2 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆127May 11, 2026Updated last month
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆64Jul 30, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/☆27Jun 17, 2024Updated 2 years ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 3 years ago
- "Head-to-Tail How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?" (NAACL 2024)☆19Jul 1, 2024Updated last year
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated last year
- ☆10Apr 25, 2024Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 7 years ago
- Lecture notes and code☆26Feb 6, 2020Updated 6 years ago
- Language Model Accessor Object - Neural autocomplete for Overleaf☆13May 2, 2023Updated 3 years ago
- Cryptic crossword solver☆40Oct 9, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆224May 11, 2026Updated last month
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,956Mar 8, 2024Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- ☆14Jul 10, 2021Updated 4 years ago
- [IEEE VL/HCC'25]Frontend Diffusion is an end-to-end LLM-powered tool that generates high-quality websites from user sketches.☆19Oct 10, 2025Updated 8 months ago
- ChatGPT-rs is a lightweight ChatGPT client with a graphical user interface, written in Rust. It allows you to chat with OpenAI's GPT mode…☆13Apr 5, 2023Updated 3 years ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated 2 years ago