tangkk / audiomod
audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and audio filters such as vibrato, tremolo, ring-modulation, compression, reverb, equalizer, etc. It is a good starting point for audio signal processing developpers to build more advanced applications, or for st…
☆3Updated 8 months ago
Alternatives and similar repositories for audiomod:
Users that are interested in audiomod are comparing it to the libraries listed below
- Official implementation of Self-Remixing☆13Updated last year
- real-time speech enhance☆14Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 8 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ☆23Updated 11 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆22Updated last year
- ☆11Updated 2 years ago
- A python algorithm to change the pitch of the voice in real time☆13Updated 4 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- real-time speech enhance skip-dpcrn-base using C++☆22Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 8 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- ☆17Updated 3 years ago
- Audio Super-Resolution using Deep Learning☆8Updated 2 years ago
- Reimplementation of Miipher☆20Updated last year
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆42Updated 4 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last month
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 4 years ago
- Zero-Shot Blind Audio Bandwidth Extension☆21Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆19Updated 2 weeks ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 5 months ago