elevenlabs / elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
β2,510Updated this week
Alternatives and similar repositories for elevenlabs-python:
Users that are interested in elevenlabs-python are comparing it to the libraries listed below
- Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorchβ1,497Updated 2 weeks ago
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ3,290Updated 10 months ago
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processingβ1,347Updated last year
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,846Updated 4 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,593Updated last year
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorchβ2,536Updated 3 months ago
- Fast TorToiSe inference (5x or your money back!)β809Updated 10 months ago
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,007Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β7,453Updated this week
- β1,127Updated 2 months ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,390Updated last month
- MARS5 speech model (TTS) from CAMB.AIβ2,753Updated 9 months ago
- A webui for different audio related Neural Networksβ1,159Updated 8 months ago
- Foundational model for human-like, expressive TTSβ4,106Updated 9 months ago
- Controllable and fast Text-to-Speech for over 7000 languages!β1,583Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,864Updated last year
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.htmlβ2,122Updated last year
- Converts text to speech in realtimeβ2,983Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,234Updated last month
- Inference and training library for high-quality TTS models.β5,229Updated 5 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ969Updated 6 months ago
- A python package to build AI-powered real-time audio applicationsβ1,278Updated 2 months ago
- An unofficial PyTorch implementation of the audio LM VALL-Eβ2,990Updated 2 years ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,509Updated 5 months ago
- β8,341Updated 10 months ago
- A family of diffusion models for text-to-audio generation.β1,163Updated 4 months ago
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,797Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,599Updated 9 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β581Updated last year
- β586Updated last year