Pipecat voice AI agents running locally on macOS
☆322Aug 26, 2025Updated 9 months ago
Alternatives and similar repositories for macos-local-voice-agents
Users that are interested in macos-local-voice-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆82Mar 19, 2026Updated 2 months ago
- ☆20Oct 25, 2025Updated 7 months ago
- ☆21Jan 15, 2026Updated 4 months ago
- Examples using MLX Swift☆13Apr 9, 2025Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆358Mar 18, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Craft and run Agents right from your phone☆32Oct 14, 2025Updated 7 months ago
- "Hey, Computer" from Star Trek. Talk to your agent. Run hooks after trigger comands. Runs locally, cause shit's scary.☆149May 10, 2026Updated last month
- ☆40Apr 15, 2026Updated last month
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆33Mar 12, 2026Updated 2 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Jun 18, 2025Updated 11 months ago
- ☆26Apr 10, 2026Updated 2 months ago
- llms can learn their own context compression via RL☆43Nov 26, 2025Updated 6 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆30Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- experiments with MLX☆68Dec 15, 2025Updated 5 months ago
- Cog wrapper for canopylabs/orpheus-3b-0.1-ft☆22Mar 20, 2025Updated last year
- The easiest way to run the fastest MLX-based LLMs locally☆327Oct 30, 2024Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 11 months ago
- Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!☆23May 25, 2026Updated 2 weeks ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- ☆41Aug 21, 2025Updated 9 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆135Feb 27, 2026Updated 3 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆4,920Jun 4, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- cog implementation of All-In-One Music Structure Analyzer☆64Nov 5, 2024Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Dec 4, 2025Updated 6 months ago
- NeurIPS 2026 paper: The Geometry of Consolidation — follow-up to HIDE and No-Escape.☆110May 5, 2026Updated last month
- SmolVLM2 Demo☆188Mar 20, 2025Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- generalized rust interface for subnets.☆21Nov 21, 2024Updated last year
- Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice c…☆695Jun 3, 2026Updated last week
- Train Large Language Models on MLX.☆374May 8, 2026Updated last month
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- zer0dex is a local dual-layer memory pattern for AI agents: a compressed, human-readable markdown index plus a vector store queried autom…☆53Updated this week
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆28May 27, 2025Updated last year
- ☆107Nov 1, 2025Updated 7 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆721May 9, 2026Updated last month
- Video production for developers☆44May 1, 2026Updated last month
- ☆22Aug 31, 2025Updated 9 months ago