cactus-compute / cactusLinks

Framework for running AI locally on mobile devices and wearables. Hardware-aware C/C++ backend with wrappers for Flutter & React Native. Kotlin & Swift coming soon.

☆993

Alternatives and similar repositories for cactus

Users that are interested in cactus are comparing it to the libraries listed below

Sorting:

The-Pocket-World / Pocket-Flow-Framework
Enable LLMs to Program Themselves.
☆623Updated 2 months ago
HazyResearch / minions
Big & Small LLMs working together
☆994Updated this week
google-ai-edge / ai-edge-apis
☆93Updated last week
shubham0204 / SmolChat-Android
Running any GGUF SLMs/LLMs locally, on-device in Android
☆387Updated last week
mybigday / llama.rn
React Native binding of llama.cpp
☆550Updated last week
senstella / csm-mlx
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆359Updated last month
google-ai-edge / LiteRT-LM
☆234Updated this week
madroidmaq / mlx-omni-server
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…
☆424Updated 2 weeks ago
mozilla-ai / any-agent
A single interface to use and evaluate different agent frameworks
☆499Updated this week
pico-lm / pico-train
A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics …
☆284Updated 2 weeks ago
google-ai-edge / LiteRT
LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're exp…
☆595Updated this week
sofi444 / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆661Updated last week
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆294Updated 2 months ago
letta-ai / agent-file
Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…
☆636Updated last month
ml-explore / mlx-lm
Run LLMs with MLX
☆1,125Updated last week
golf-mcp / golf
Production-Ready MCP Server Framework • Build, deploy & scale secure AI agent infrastructure • Includes Auth, Observability, Debugger, Te…
☆628Updated last week
EvanZhouDev / llm.pdf
Run LLMs inside a PDF file.
☆585Updated 2 months ago
morphik-org / morphik-core
Open source multi-modal RAG for building AI apps over private knowledge.
☆2,696Updated this week
unbody-io / unbody
The Supabase of AI era. A modular, open-source backend for building AI-native software — designed for knowledge, not static data.
☆308Updated 3 weeks ago
google-ai-edge / ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
☆678Updated this week
ngxson / wllama
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
☆750Updated 3 weeks ago
bytebot-ai / bytebot
Bytebot is the container for desktop agents.
☆727Updated this week
arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆308Updated 3 months ago
pipecat-ai / smart-turn
☆754Updated 2 months ago
fluxions-ai / vui
☆577Updated this week
cognitivecomputations / dolphin-mcp
☆475Updated last month
chonkie-inc / chonkie
🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library
☆1,538Updated this week
lmstudio-ai / mlx-engine
Apple MLX engine for LM Studio
☆630Updated this week
dynamiq-ai / dynamiq
Dynamiq is an orchestration framework for agentic AI and LLM applications
☆882Updated last week
meta-llama / llama-prompt-ops
An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.
☆493Updated this week