cactus-compute / cactusLinks
Kernels & AI inference engine for phones
☆3,521Updated last week
Alternatives and similar repositories for cactus
Users that are interested in cactus are comparing it to the libraries listed below
Sorting:
- Communicate with an LLM provider using a single interface☆1,017Updated this week
- A high-performance inference engine for AI models☆1,350Updated last week
- Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…☆1,549Updated this week
- HelixDB is an open-source graph-vector database built from scratch in Rust.☆2,879Updated last week
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,543Updated last month
- Omnara (YC S25) - Talk to Your AI Agents from Anywhere!☆2,452Updated 3 weeks ago
- Big & Small LLMs working together☆1,187Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆6,830Updated 5 months ago
- ☆445Updated last week
- AI agents can now use real Android and iOS apps, just like a human.☆1,782Updated this week
- VS Code extension for LLM-assisted code/text completion☆1,011Updated last week
- The most accurate document search and store for building AI apps☆3,339Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,792Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆894Updated this week
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆14,248Updated this week
- An MCP server that autonomously evaluates web applications.☆1,197Updated 3 weeks ago
- Artificial Neural Engine Machine Learning Library☆1,216Updated 2 months ago
- ☆2,041Updated this week
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙☆1,345Updated last week
- Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser☆1,262Updated last month
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,760Updated last week
- Running any GGUF SLMs/LLMs locally, on-device in Android☆546Updated last month
- On-device TTS model by Neuphonic☆3,759Updated this week
- 🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library☆3,051Updated last week
- 🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.☆6,318Updated this week
- Everything about the SmolLM and SmolVLM family of models☆3,346Updated last month
- Run LLMs with MLX☆2,690Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,475Updated this week
- The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrail…☆4,260Updated this week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,932Updated 3 months ago