waveplate / openduplexLinks
openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you
☆36Updated 2 years ago
Alternatives and similar repositories for openduplex
Users that are interested in openduplex are comparing it to the libraries listed below
Sorting:
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last year
- an improved version of Real-time-voice-cloning☆52Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Updated 2 years ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Updated 7 months ago
- canvas-based talking head model using viseme data☆32Updated 2 years ago
- Ai generated music video with Riffusion and Gradio☆22Updated 3 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- Transcription and annotation interface for recorded audio or video files☆50Updated this week
- 🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.☆31Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆73Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆26Updated last month
- Creates video from TTS output and viseme images.☆15Updated 3 years ago
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆28Updated 5 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated 2 years ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated last month
- A tool to extend camelai's plans and thoughts to browser-use web automation☆12Updated 10 months ago
- ☆18Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆68Updated 2 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Updated 2 years ago
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆120Updated 2 years ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 2 years ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆40Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Updated last year
- ☆75Updated last year