cactus-compute / cactusLinks
Framework for running AI locally on mobile devices and wearables. Hardware-aware C/C++ backend with wrappers for Flutter & React Native. Kotlin & Swift coming soon.
☆682Updated this week
Alternatives and similar repositories for cactus
Users that are interested in cactus are comparing it to the libraries listed below
Sorting:
- Enable LLMs to Program Themselves.☆609Updated last month
- Run LLMs with MLX☆912Updated this week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆404Updated this week
- Big & Small LLMs working together☆920Updated this week
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆737Updated last month
- ☆283Updated last month
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆349Updated 3 weeks ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,314Updated this week
- On-device LLM Inference Powered by X-Bit Quantization☆245Updated this week
- FastMLX is a high performance production ready API to host MLX models.☆307Updated 2 months ago
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆867Updated this week
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆656Updated this week
- Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.☆2,011Updated last week
- Apple MLX engine for LM Studio☆583Updated 2 weeks ago
- VS Code extension for LLM-assisted code/text completion☆778Updated last week
- On-device intelligence.☆348Updated 2 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,102Updated last month
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.☆360Updated last week
- ☆206Updated 4 months ago
- Gemma 2 optimized for your local machine.☆370Updated 10 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆238Updated last week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆473Updated 2 weeks ago
- On-device Image Generation for Apple Silicon☆618Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,301Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆208Updated 7 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆712Updated last year
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆848Updated last week
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,478Updated this week
- On-device Speech Recognition for Android☆92Updated last week
- Reasoning Augmented Generation☆847Updated 3 months ago