Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback, and works with local LLM/TTS services via OpenAI-compatible endpoints.
☆75Updated this week
Alternatives and similar repositories for Vocalis:
Users that are interested in Vocalis are comparing it to the libraries listed below
- ☆68Updated last month
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆57Updated this week
- Orpheus Chat WebUI☆48Updated 2 weeks ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆196Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆232Updated this week
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆60Updated 3 weeks ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆165Updated 3 weeks ago
- ☆54Updated this week
- A frontend for creative writing with LLMs☆123Updated 9 months ago
- ☆46Updated last month
- OpenAI compatible TTS for Sesame CSM:1b - Voice Cloning from File/YT☆275Updated 3 weeks ago
- API server for Instant voice cloning by MyShell.☆88Updated 6 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆34Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- ☆129Updated last week
- Streaming and Finetuning code for CSM☆97Updated this week
- Run Orpheus 3B Locally With LM Studio☆27Updated 3 weeks ago
- ☆113Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆77Updated 6 months ago
- Your personal and private AI☆45Updated last week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 5 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 5 months ago
- Examples of using the llasa-tts models locally☆160Updated 2 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆22Updated 2 weeks ago
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.☆54Updated 2 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆117Updated 5 months ago
- ☆46Updated last month