nomonosound / fast-align-audio
A fast python library for aligning similar audio snippets passed in as NumPy arrays
☆42Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for fast-align-audio
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 2 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 3 months ago
- Landing Page for All Things Source Separation☆17Updated 2 weeks ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- ☆61Updated 7 months ago
- ☆40Updated 5 months ago
- SDX23 startkit for the Demucs baselines.☆24Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆19Updated 7 months ago
- ☆21Updated 7 months ago
- ☆49Updated last year
- A C++/Cython audio limiter for Python.☆23Updated last year
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆36Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆38Updated last year
- Prosody and Pronunciation Modification Network☆44Updated 3 months ago
- ☆41Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 8 months ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆45Updated last year
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems☆37Updated this week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- ☆43Updated 3 weeks ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆52Updated 2 years ago
- Reproducible Subjective Evaluation☆57Updated 8 months ago
- ☆29Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago