tangkk / audiomodLinks
audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and audio filters such as vibrato, tremolo, ring-modulation, compression, reverb, equalizer, etc. It is a good starting point for audio signal processing developpers to build more advanced applications, or for st…
☆3Updated 11 months ago
Alternatives and similar repositories for audiomod
Users that are interested in audiomod are comparing it to the libraries listed below
Sorting:
- Official implementation of Self-Remixing☆14Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆12Updated 11 months ago
- ☆13Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 3 weeks ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated last month
- A python algorithm to change the pitch of the voice in real time☆13Updated 4 years ago
- ☆10Updated 8 months ago
- ☆27Updated last year
- ☆10Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- A purely header only c version of hifi-gan☆9Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆19Updated 2 years ago
- ☆18Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated 2 years ago
- ☆12Updated 2 years ago
- Multi-Resolution Neural Networks☆16Updated last week
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆13Updated last year
- Audio Super-Resolution using Deep Learning☆8Updated 2 years ago
- real-time speech enhance☆16Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆30Updated 10 months ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆21Updated last week
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆16Updated last week
- Reimplementation of Miipher☆22Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 11 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆13Updated 11 months ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆43Updated 5 years ago