elevenlabs / elevenlabs-pythonLinks
The official Python SDK for the ElevenLabs API.
β2,694Updated this week
Alternatives and similar repositories for elevenlabs-python
Users that are interested in elevenlabs-python are comparing it to the libraries listed below
Sorting:
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ3,322Updated last week
- Foundational model for human-like, expressive TTSβ4,155Updated last year
- Controllable and fast Text-to-Speech for over 7000 languages!β1,635Updated 2 months ago
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorchβ1,528Updated 4 months ago
- A multi-voice TTS system trained with an emphasis on qualityβ14,536Updated 9 months ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,570Updated 5 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,928Updated last year
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,395Updated last year
- Converts text to speech in realtimeβ3,424Updated last month
- Fast TorToiSe inference (5x or your money back!)β825Updated last year
- Project that allows one to use a microphone with OpenAI whisper.β776Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,631Updated last year
- Inference and training library for high-quality TTS models.β5,399Updated 8 months ago
- AI powered speech denoising and enhancementβ1,939Updated 8 months ago
- An unofficial PyTorch implementation of the audio LM VALL-Eβ2,984Updated 2 years ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,933Updated 7 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,579Updated 7 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,018Updated 9 months ago
- The code for the bark-voicecloning model. Training and inference.β704Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,647Updated 9 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β584Updated 2 years ago
- An Open Source text-to-speech system built by inverting Whisper.β4,347Updated 2 months ago
- MARS5 speech model (TTS) from CAMB.AIβ2,793Updated last year
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,011Updated last year
- π€ Build voice-based LLM agents. Modular + open source.β3,424Updated 9 months ago
- Python client for Replicateβ859Updated this week
- Text-to-Audio/Music Generationβ2,484Updated 11 months ago
- β1,134Updated 6 months ago
- A webui for different audio related Neural Networksβ1,198Updated 3 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,165Updated this week