Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆130Sep 7, 2025Updated 6 months ago
Alternatives and similar repositories for Vocal-Agent
Users that are interested in Vocal-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆85Dec 22, 2025Updated 3 months ago
- ☆50Oct 26, 2025Updated 5 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆250Jan 20, 2025Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆293Apr 14, 2025Updated 11 months ago
- ☆19Jul 4, 2025Updated 8 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆15Feb 1, 2025Updated last year
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆63Jan 28, 2025Updated last year
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆109Jun 25, 2025Updated 9 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 10 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- AI Search engine☆13Sep 24, 2025Updated 6 months ago
- Orpheus Chat WebUI☆75Mar 27, 2025Updated last year
- ☆17Dec 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated 10 months ago
- Random llm scripts☆37Mar 18, 2026Updated last week
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated 10 months ago
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 8 months ago
- eleven labs agent☆26Oct 29, 2024Updated last year
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆684Mar 18, 2026Updated last week
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 11 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated 11 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 3 months ago
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆18Oct 17, 2024Updated last year
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆57Feb 25, 2026Updated last month
- ☆202Mar 31, 2025Updated 11 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated 11 months ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- A simple CLI app which allows you to generate and deploy simple apps. MVP.☆21Aug 4, 2025Updated 7 months ago
- A React-based web application that allows users to share their screen and audio with an AI assistant. The assistant provides real-time tr…☆22Sep 22, 2025Updated 6 months ago
- Allows two LLMs to communicate and run code in the terminal☆28Dec 8, 2024Updated last year
- ☆27Jun 11, 2025Updated 9 months ago
- realtime conversational dynamics☆19Mar 19, 2025Updated last year