elevenlabs / opuspyLinks
Opus codec support for Python.
☆31Updated 3 years ago
Alternatives and similar repositories for opuspy
Users that are interested in opuspy are comparing it to the libraries listed below
Sorting:
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- VoiceBox neural network implementation☆110Updated last year
- ☆275Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆138Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- StyleTTS 2 Optimized Training Fork☆33Updated 11 months ago
- Monotonic Alignment Search☆100Updated 7 months ago
- VoiceLDM: Text-to-Speech with Environmental Context☆190Updated last year
- VALL-E 2 reproduction☆133Updated last year
- ☆61Updated 2 years ago
- ☆62Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- Codebase and project page for EDMSound☆35Updated 2 years ago
- Official Implementation of StyleTTS-VC☆194Updated 11 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆145Updated 3 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Updated 2 years ago
- Audiogen Codec☆144Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- Open TTS models, built for streaming on the edge☆44Updated 9 months ago
- Supervoice diffusion enhance☆28Updated last year
- High quality text-to-speech based on StyleTTS 2.☆71Updated 3 weeks ago
- A sequence-to-sequence voice conversion toolkit.☆106Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆113Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆84Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 5 months ago