Implementation of mamba with rust
☆92Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-ssm
Users that are interested in mamba-ssm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM inference in Fortran☆64May 30, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆199Updated this week
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- Modified Mamba code to run on CPU☆30Jan 14, 2024Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated 2 years ago
- Some preliminary explorations of Mamba's context scaling.☆218Feb 8, 2024Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆39Feb 23, 2026Updated last month
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- ☆30Feb 27, 2024Updated 2 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆68Aug 21, 2024Updated last year
- Copy a bunch of files into your clipboard to provide context for LLMs☆112Feb 8, 2026Updated last month
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Code repository for Black Mamba☆263Feb 8, 2024Updated 2 years ago
- It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the Eleu…☆15Jul 24, 2023Updated 2 years ago
- ☆54Nov 22, 2024Updated last year
- ☆12May 30, 2025Updated 9 months ago
- Rust binding for WFA2-lib☆10Jun 7, 2022Updated 3 years ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆43Jul 4, 2025Updated 8 months ago
- ☆27Feb 26, 2026Updated 3 weeks ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆104Oct 14, 2025Updated 5 months ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- High-Performance Text Deduplication Toolkit☆62Aug 25, 2025Updated 7 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- ☆12Apr 4, 2024Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆21Updated this week
- Run Claude Code on OpenAI models☆20Jul 13, 2025Updated 8 months ago
- A pure Python implementation for TA-LIB based on Cython (Progress: 92/158 Indicators)☆15Jul 27, 2025Updated 7 months ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- BGUI stands for BOOPSI Graphical User Interface. BGUI is free GUI toolkit for the Amiga OS.☆11Apr 29, 2025Updated 10 months ago
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆11Jan 27, 2025Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 5 months ago
- Hardware-accelerated matrix/numeric programming library for Swift☆12Sep 2, 2025Updated 6 months ago
- Inference Llama 2 in one file of pure Haskell (A port of llama2.c from Andrej Karpathy)☆14Oct 17, 2025Updated 5 months ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Dec 18, 2024Updated last year