tarun7r / Vocal-AgentView external linksLinks
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆128Sep 7, 2025Updated 5 months ago
Alternatives and similar repositories for Vocal-Agent
Users that are interested in Vocal-Agent are comparing it to the libraries listed below
Sorting:
- ☆19Jul 4, 2025Updated 7 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 8 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆285Apr 14, 2025Updated 9 months ago
- AI Search engine☆13Sep 24, 2025Updated 4 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆107Jun 25, 2025Updated 7 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆61Jan 28, 2025Updated last year
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 10 months ago
- ☆50Oct 26, 2025Updated 3 months ago
- ☆17Dec 16, 2024Updated last year
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated 9 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 9 months ago
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 9 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 2 months ago
- eleven labs agent☆26Oct 29, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 10 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆50May 20, 2025Updated 8 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 9 months ago
- ☆15Apr 9, 2025Updated 10 months ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 9 months ago
- ☆12Jan 20, 2026Updated 3 weeks ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Apr 5, 2025Updated 10 months ago
- Random llm scripts☆37Nov 18, 2025Updated 2 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 2 months ago
- ☆201Mar 31, 2025Updated 10 months ago
- Local drive deep search.☆32Jun 4, 2025Updated 8 months ago
- ☆27Jun 11, 2025Updated 8 months ago
- Web application for roleplaying with AI-powered characters☆67Jul 8, 2025Updated 7 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 8 months ago
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Moondream MCP Server in Python☆45Jul 2, 2025Updated 7 months ago
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated 9 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser