Lex-au/Vocalis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lex-au/Vocalis)

Lex-au / Vocalis

Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback, and works with local LLM/TTS services via OpenAI-compatible endpoints.

☆309

Alternatives and similar repositories for Vocalis

Users that are interested in Vocalis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taresh18 / conversify
View on GitHub
🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs
☆111Jun 25, 2025Updated last year
Lex-au / Orpheus-FastAPI
View on GitHub
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆716Jul 5, 2025Updated last year
ReisCook / VoiceAssistant
View on GitHub
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
☆81May 19, 2025Updated last year
tarun7r / Vocal-Agent
View on GitHub
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆138Sep 7, 2025Updated 10 months ago
ExoFi-Labs / OllamaGTTS
View on GitHub
☆202Mar 31, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
davidbrowne17 / csm-streaming
View on GitHub
Realtime demo, Streaming and Finetuning code for CSM
☆455Sep 17, 2025Updated 10 months ago
PasiKoodaa / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆32May 1, 2025Updated last year
isaiahbjork / orpheus-tts-local
View on GitHub
Run Orpheus 3B Locally With LM Studio
☆546Mar 20, 2025Updated last year
QuwsarOhi / NanoAgent
View on GitHub
An agent that can run everywhere - even in your watch!
☆34Apr 8, 2026Updated 3 months ago
fidecastro / llama-cpp-connector
View on GitHub
Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!
☆31Dec 11, 2025Updated 7 months ago
fajrmn / kokoro-on-browser
View on GitHub
☆16Feb 1, 2025Updated last year
akashjss / sesame-csm
View on GitHub
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆214May 9, 2025Updated last year
rhulha / Speech2Speech
View on GitHub
A web application that converts speech to speech 100% private
☆86Jun 3, 2025Updated last year
KoljaB / RealtimeVoiceChat
View on GitHub
Have a natural, spoken conversation with AI!
☆3,804Jul 11, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
PkmX / orpheus-chat-webui
View on GitHub
Orpheus Chat WebUI
☆76Mar 27, 2025Updated last year
houtianze / audiobook-generator
View on GitHub
☆15Mar 18, 2026Updated 4 months ago
KartDriver / mira_converse
View on GitHub
☆83Feb 28, 2025Updated last year
ColeMurray / moondream-mcp
View on GitHub
Moondream MCP Server in Python
☆49Jul 2, 2025Updated last year
RhinoDevel / mt_llm
View on GitHub
Pure C wrapper library to use llama.cpp with Linux and Windows as simple as possible.
☆15Updated this week
iluxu / llmbasedos
View on GitHub
llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work
☆289Jan 6, 2026Updated 6 months ago
julianthomas04 / Nova2
View on GitHub
An AI assistant building SDK in python
☆43Sep 21, 2025Updated 10 months ago
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 9 months ago
kyutai-labs / unmute
View on GitHub
Make text LLMs listen and speak
☆1,369Jul 16, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
elevenyellow / handcrafted-persona-engine
View on GitHub
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applica…
☆1,331May 20, 2026Updated 2 months ago
KevinAHM / echo-tts-api
View on GitHub
Echo-TTS OpenAI Compatible Speech Endpoint w/ Streaming
☆29Apr 5, 2026Updated 3 months ago
FarFetchd / sleepyllama
View on GitHub
an auto-sleeping and -waking framework around llama.cpp
☆13Feb 8, 2025Updated last year
christopherthompson81 / vernacula
View on GitHub
ONNX speech pipeline library for ASR, diarization, VAD, and denoising
☆19Jun 14, 2026Updated last month
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,258Dec 5, 2025Updated 7 months ago
PasiKoodaa / ACE-Step-RADIO
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆50May 20, 2025Updated last year
rsxdalv / chatterbox
View on GitHub
SoTA open-source TTS
☆165Dec 16, 2025Updated 7 months ago
smy20011 / MorningRadio
View on GitHub
Generate Your Own Private Morning Radio for Commute
☆33Feb 5, 2025Updated last year
arkaprovob / litellm-hf-local
View on GitHub
A custom LiteLLM provider enabling local execution of Hugging Face models with streaming, quantization, and async support
☆30Jun 22, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thad0ctor / llama-server-launcher
View on GitHub
Llama Server Launcher (llama.cpp/ik_llama) GUI
☆123Updated this week
remsky / Kokoro-FastAPI
View on GitHub
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-s…
☆5,248Updated this week
freddyaboulton / orpheus-cpp
View on GitHub
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆353Apr 10, 2025Updated last year
hyperfocAIs / Attend
View on GitHub
Attend - to what matters.
☆17Feb 22, 2025Updated last year
phildougherty / qwen2.5_omni_chat
View on GitHub
Service for testing out the new Qwen2.5 omni model
☆62Apr 30, 2025Updated last year
dkruyt / webollama
View on GitHub
A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…
☆81Oct 8, 2025Updated 9 months ago
TesslateAI / TFrameX
View on GitHub
☆180Aug 10, 2025Updated 11 months ago