playht / pyht
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
☆201Updated this week
Alternatives and similar repositories for pyht:
Users that are interested in pyht are comparing it to the libraries listed below
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- ☆37Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆288Updated 8 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆196Updated last month
- Talk to GPT-4 and create a story together.☆88Updated last year
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆277Updated 9 months ago
- ☆173Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆156Updated 8 months ago
- A playful script to get two AI assistants to converse using OpenAI Assistants API☆196Updated last year
- ☆89Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 11 months ago
- AITuber Server☆152Updated 4 months ago
- ☆365Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- This code implements a Local LLM Selector from the list of Local Installed Ollama LLMs for your specific user Query☆102Updated last year
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆156Updated 7 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆228Updated 5 months ago
- Site for sharing Bark voices☆50Updated last week
- 🐮📢 The first AI voice assistant that interrupts *you*☆140Updated 6 months ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Groq-Powered Real-Time Voice Assistant☆212Updated 4 months ago
- ☆86Updated last year
- Scripts to create your own moe models using mlx☆89Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Example of calling OpenRouter from a Streamit app☆94Updated last year
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed☆128Updated 10 months ago
- Question-answering chatbot using OpenAI's GPT-3.5-turbo model, DeepLake for the vector database, and the Whisper API for voice transcript…☆163Updated 10 months ago