yeutterg / speech-to-speech-smart-speakerLinks
Smart speaker (like the Amazon Echo) based on the OpenAI Realtime API
☆20Updated last year
Alternatives and similar repositories for speech-to-speech-smart-speaker
Users that are interested in speech-to-speech-smart-speaker are comparing it to the libraries listed below
Sorting:
- AI Raspberry Pi cat detection and notification: get a text when your cat does something it's not supposed to do, and have AI narrate what…☆193Updated last year
- ☆135Updated 11 months ago
- gpt-oss + voice-ui-kit experiment☆152Updated 5 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆393Updated 5 months ago
- Pipecat voice AI agents running locally on macOS☆301Updated 4 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆225Updated 3 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆58Updated last year
- ☆143Updated last week
- "Hey Meta send a message to ChatGPT" Mai: A Hacky Messenger browser extension & pseudo API for the Meta Glasses☆657Updated 6 months ago
- Minimal lightweight task orchestrator in Rust☆120Updated last week
- The Official Nimbus SDK☆204Updated this week
- MLX-GUI MLX Inference Server for Apple Silicone☆166Updated last week
- ☆461Updated 2 weeks ago
- Various things I had to figure out recently to make things work better...☆151Updated last week
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆68Updated 8 months ago
- ☆191Updated last month
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 10 months ago
- TermNet is an AI-powered terminal assistant that bridges a Large Language Model (LLM) with your local environment. It can safely run shel…☆94Updated 3 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆273Updated 2 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆221Updated last year
- ☆191Updated last year
- Demo showing how to use the OpenAI Realtime API to navigate a 3D scene via tool calling☆469Updated 11 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 4 months ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆91Updated last year
- Qwen Image models through MPS☆254Updated 3 weeks ago
- BeeMCP: an unofficial Model Context Protocol (MCP) server that connects your Bee wearable lifelogger to AI via the Model Context Protocol☆43Updated 9 months ago
- An AI cursor for desktop using Gemini 2.0 Flash (Experimental)☆337Updated 11 months ago
- ☆68Updated 10 months ago
- Harness the scientific methods of Sydney Brenner using AI Agents☆39Updated 2 weeks ago
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆541Updated last year