PromtEngineer / Verbi
A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepgram APIs, plus local models via Ollama. Ideal for research and development in voice technology.
☆970Updated 2 weeks ago
Alternatives and similar repositories for Verbi:
Users that are interested in Verbi are comparing it to the libraries listed below
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆628Updated 8 months ago
- ☆326Updated 8 months ago
- Command Your World with Voice☆659Updated 4 months ago
- ☆368Updated last year
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,441Updated 4 months ago
- Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API☆695Updated 6 months ago
- Multi-modal conversational AI (xRx) system☆301Updated 3 months ago
- Low code tool to rapidly build and coordinate multi-agent teams☆989Updated 3 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆316Updated last week
- Open source conversation framework and visual editor for structured Pipecat dialogues☆307Updated 2 weeks ago
- Local SRT/LLM/TTS Voicechat☆667Updated 6 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆324Updated 2 weeks ago
- Generate imagined websites on an infinite canvas☆600Updated 10 months ago
- Interface for OuteTTS models.☆1,205Updated this week
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆736Updated 2 months ago
- openperplex is an opensource AI search engine☆855Updated 9 months ago
- End-to-end platform for building voice first multimodal agents☆419Updated 6 months ago
- Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260…☆613Updated 3 weeks ago
- Versatile agents for long running, research intensive tasks.☆399Updated 7 months ago
- A fast multimodal LLM for real-time voice☆3,896Updated 2 months ago
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆620Updated 3 months ago
- ☆863Updated last month
- Deepgram Conversational AI demo☆387Updated 3 weeks ago
- Convert any PDF into a podcast episode!☆2,245Updated 4 months ago
- Introducing the Assistant Swarm. An extension to the OpenAI Node SDK to automatically delegate work to any assistant you create in OpenAi…☆519Updated last year
- RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex & Langchain. Supports any public LLM supported by L…☆419Updated last week
- first base model for full-duplex conversational audio☆1,737Updated 4 months ago
- ☆1,126Updated 2 months ago
- Local realtime voice AI☆2,284Updated 2 months ago
- Sesame CSM 1B Voice Cloning☆287Updated last month