mallahyari / RealtimeSTT-TTSLinks
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆40Updated last year
Alternatives and similar repositories for RealtimeSTT-TTS
Users that are interested in RealtimeSTT-TTS are comparing it to the libraries listed below
Sorting:
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆134Updated last year
- Self-hosted AI voice agent☆111Updated 10 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- ☆89Updated last year
- Agent with vision ability via llava & autogen☆73Updated last year
- Talking head video AI generator☆78Updated last year
- ☆68Updated last year
- A general purpose AI voice assistant built using GPT-4.☆33Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆35Updated this week
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆295Updated last month
- 🤖 Sam-assistant is a personal assistant that is designed to understand your documents, search the internet, and in future versions, crea…☆49Updated last year
- Here is a collection of cool applications that I've built with AssemblyAI☆36Updated 10 months ago
- A basic voice agent built with Python agents framework☆50Updated 2 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆86Updated last year
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆28Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆119Updated 11 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆138Updated 2 months ago
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆60Updated last year
- Build Phone Calling Voice Agent fully powered by open source models.☆50Updated 2 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆51Updated 7 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆39Updated 8 months ago
- Real-time Speech To Text using Faster Whisper.☆57Updated 11 months ago
- AI Voice Assistant project☆40Updated 3 months ago
- Talk to GPT-4 and create a story together.☆91Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆64Updated 9 months ago
- An intellligent AI assistant that can do anything!☆54Updated last year
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆17Updated 8 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆42Updated 11 months ago
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to g…☆31Updated 2 years ago