playht / pyhtLinks
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
☆216Updated last month
Alternatives and similar repositories for pyht
Users that are interested in pyht are comparing it to the libraries listed below
Sorting:
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- ⚙️ Zero-Shot Autonomous Robots☆116Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- ☆37Updated last year
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆305Updated 3 months ago
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- ☆277Updated last year
- ☆89Updated last year
- ☆99Updated last year
- ☆175Updated last year
- Cog wrapper for Coqui / xtts-v2☆78Updated 9 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆223Updated 7 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆100Updated 3 months ago
- faster-whisper as serverless endpoint☆117Updated 4 months ago
- CLAIRe: Conversational Learning AI with Recall☆67Updated 2 years ago
- Lightweight GPT-4 Vision processing over the Webcam☆286Updated last year
- The official Cartesia client for Python.☆104Updated last week
- Self-hosted AI voice agent☆114Updated last year
- ☆202Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- A reproduction of the Gemini demo using GPT-vision.☆127Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆216Updated last year
- Deepgram Conversational AI demo☆401Updated last month
- Use ChatGPT over Twilio to create an AI phone agent (works for incoming or outgoing calls).☆113Updated last year
- A playful script to get two AI assistants to converse using OpenAI Assistants API☆200Updated last year
- ☆343Updated last year