Modified Mamba code to run on CPU
☆32Jan 14, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-cpu
Users that are interested in mamba-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 6 months ago
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆105Oct 14, 2025Updated 6 months ago
- Dynamically controllable Llama-model LLM inference in macOS with MLX☆18Feb 8, 2025Updated last year
- LLM inference in Fortran☆64May 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year
- Implementation of mamba with rust☆94Mar 9, 2024Updated 2 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆126Apr 13, 2026Updated 3 weeks ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 9 months ago
- Some random tools for working with the GGUF file format☆32Nov 24, 2023Updated 2 years ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 9 months ago
- Simplified example on how to use Vue Design System as an NPM Dependency on Nuxt project☆14Oct 8, 2018Updated 7 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Apr 24, 2024Updated 2 years ago
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆28Jul 28, 2024Updated last year
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆40Feb 23, 2026Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated last year
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- ☆17Mar 28, 2025Updated last year
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- In-browser 3D compact disc player simulation using THREE.JS (WebGL) and Web Audio API☆12Jul 31, 2020Updated 5 years ago
- Yet another `llama.cpp` Rust wrapper☆12Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- rdiv!(::AbstractMatrix, ::UpperTriangular) and ldiv!(::LowerTriangular, ::AbstractMatrix)☆12Nov 18, 2024Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Julia implementation of flash-attention operation for neural networks.☆11May 31, 2023Updated 2 years ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted …☆18Jun 12, 2023Updated 2 years ago
- Fast little priority queue for Rust.☆15Jun 1, 2025Updated 11 months ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.☆16Nov 18, 2024Updated last year
- The Lily programming language ⚜☆10Apr 7, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- Sample apps for PUBG developer challenge (Feb 2019)☆13Dec 9, 2022Updated 3 years ago
- PatANN - Pattern-Aware Vector Database and ANN Framework☆20Apr 24, 2025Updated last year
- Automatic differentiation of FEniCS and Firedrake models in Julia☆14Mar 21, 2021Updated 5 years ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆25Updated this week
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 6 years ago