elevenlabs / opuspyLinks
Opus codec support for Python.
☆30Updated 2 years ago
Alternatives and similar repositories for opuspy
Users that are interested in opuspy are comparing it to the libraries listed below
Sorting:
- VoiceBox neural network implementation☆110Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆295Updated 5 months ago
- ☆273Updated last year
- Audiogen Codec☆144Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 weeks ago
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆140Updated 2 years ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆58Updated last month
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆31Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- Pytorch implementation of BigVSAN☆203Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆64Updated last week
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 7 months ago
- Your one-stop solution for voice dataset creation☆123Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆139Updated 3 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆138Updated 11 months ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 4 months ago
- Speaker Diarization with Transformers☆69Updated 3 months ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆63Updated 3 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- ☆87Updated 11 months ago
- Create training data for training a voice cloner for bark text to speech.☆46Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year