elevenlabs / opuspyLinks
Opus codec support for Python.
☆29Updated 2 years ago
Alternatives and similar repositories for opuspy
Users that are interested in opuspy are comparing it to the libraries listed below
Sorting:
- VoiceBox neural network implementation☆109Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 7 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆286Updated 4 months ago
- Unofficial implementation of wavenext vocoder☆48Updated 11 months ago
- StyleTTS 2 Optimized Training Fork☆33Updated 6 months ago
- ☆63Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆66Updated 2 weeks ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆41Updated 3 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- High quality text-to-speech based on StyleTTS 2.☆59Updated this week
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 4 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31Updated 2 months ago
- Audiogen Codec☆143Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 4 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- ☆60Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆31Updated last week
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆25Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆137Updated 2 years ago
- Official implementation for FlowSep☆58Updated 7 months ago
- Heteronym to Phoneme Parser☆18Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Supervoice diffusion enhance☆27Updated last year