tangkk / audiomod
audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and audio filters such as vibrato, tremolo, ring-modulation, compression, reverb, equalizer, etc. It is a good starting point for audio signal processing developpers to build more advanced applications, or for st…
☆3Updated 9 months ago
Alternatives and similar repositories for audiomod:
Users that are interested in audiomod are comparing it to the libraries listed below
- Official implementation of Self-Remixing☆13Updated last year
- real-time speech enhance☆15Updated last year
- ☆25Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- Spherical residual vector quantization (SRVQ)☆28Updated 8 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 9 months ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Audio Super-Resolution using Deep Learning☆8Updated 2 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆19Updated 2 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- ☆10Updated 6 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 9 months ago
- ☆13Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- Landing Page for Divide and Remaster v3☆17Updated 9 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆21Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated last year
- real-time speech enhance skip-dpcrn-base using C++☆23Updated 2 years ago
- ☆21Updated last year
- ☆17Updated 3 years ago
- Multi-Resolution Neural Networks☆15Updated 2 weeks ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago