waveplate / openduplexLinks
openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you
☆36Updated 2 years ago
Alternatives and similar repositories for openduplex
Users that are interested in openduplex are comparing it to the libraries listed below
Sorting:
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 2 months ago
- an improved version of Real-time-voice-cloning☆50Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Talk with ChatGPT using your VOICE☆123Updated last year
- ☆29Updated last month
- Transcription and annotation interface for recorded audio or video files☆41Updated this week
- A tool for making videos from PDF presentations.☆31Updated 4 years ago
- ☆17Updated 2 years ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆46Updated 6 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆33Updated last year
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated last month
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated last year
- ☆74Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 11 months ago
- A simple TTS server for generating speech using StyleTTS2☆37Updated last year
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆27Updated 4 years ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆187Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- canvas-based talking head model using viseme data☆32Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆80Updated 2 weeks ago
- 🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.☆29Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated 2 years ago
- A minimalistic streamlit chatbot UI to combine and customize tools for langchain llm agents☆12Updated last year
- DeepFloyd IF web UI☆29Updated 2 years ago
- streaming speech to text server using Whisper☆95Updated 2 years ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆125Updated 2 years ago