elevenlabs / opuspy
Opus codec support for Python.
☆25Updated last year
Related projects: ⓘ
- Simple PyTorch Denoisers for Waveform Audio☆31Updated 4 months ago
- Audiogen Codec☆116Updated 2 months ago
- Unofficial implementation of wavenext vocoder☆28Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆14Updated 3 weeks ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated 9 months ago
- Audio generation using diffusion models, in PyTorch.☆44Updated 11 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆20Updated 4 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆28Updated last year
- ☆26Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆130Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- VoiceBox neural network implementation☆88Updated last month
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated last year
- Code for Investigating Personalization Methods in Text to Music Generation☆29Updated 5 months ago
- Codebase and project page for EDMSound☆29Updated 10 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- VALL-E 2 reproduction☆72Updated 2 months ago
- NSNet2 Deep Noise Suppression (DNS) package☆29Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆40Updated last month
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- A toolkit for processing speech data and creating speech datasets☆75Updated last week
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Adaptive Vocoder for Custom Voice☆58Updated last year
- Demos of Essentia models hosted on Replicate.com☆38Updated 3 months ago
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆30Updated 11 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago
- SDX23 startkit for the Demucs baselines.☆23Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆135Updated 9 months ago