playht / text-to-speech-api
Play.ht's Text to Speech API
☆86Updated 10 months ago
Alternatives and similar repositories for text-to-speech-api:
Users that are interested in text-to-speech-api are comparing it to the libraries listed below
- Browser-based Voice Assistant☆44Updated last year
- Speech to text to speech using Elevenlabs☆28Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated last week
- Transcription with speaker diarization pipeline☆90Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- ☆37Updated last year
- 📦 Metadata for all the public models on Replicate, bundled up into an npm package.☆27Updated this week
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆1Updated 9 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆65Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- ☆17Updated 2 years ago
- Audio datasets, easier.☆82Updated last year
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to g…☆29Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- ☆107Updated 9 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆195Updated last week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated 11 months ago
- ☆26Updated last year
- Cog wrapper for Coqui / xtts-v2☆74Updated 2 months ago
- Site for sharing Bark voices☆48Updated 7 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 9 months ago
- An auto generated wiki.☆21Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- An independent voice interface for Inflection AI's conversational assistant, Pi☆17Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- ☆265Updated 8 months ago
- A simplistic UI connecting gpt-3 and stable diffusion☆15Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 11 months ago
- ☆55Updated last year