ws-choi / AMSS-NetLinks
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021)
☆21Updated 3 years ago
Alternatives and similar repositories for AMSS-Net
Users that are interested in AMSS-Net are comparing it to the libraries listed below
Sorting:
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- ☆18Updated 5 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Updated this week
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Updated 2 years ago
- ☆15Updated 2 years ago
- Unofficial PyTorch dataset for Slakh☆9Updated 4 years ago
- ☆40Updated 5 years ago
- Implementation of CREPE Pitch tracker with PyTorch☆19Updated 5 years ago
- Music Demixing Challenge Submission Repo☆15Updated last year
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Updated 5 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated 2 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 4 years ago
- Frontend filterbank learning module with HVQT initialization capabilities.☆21Updated last year
- ☆32Updated 4 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Updated 6 months ago
- ☆18Updated 3 years ago
- ☆22Updated 2 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 10 months ago
- Rough implementation of Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments (Ethan …☆25Updated 4 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Updated 3 years ago
- A C++/Cython audio limiter for Python.☆25Updated 2 years ago
- ☆25Updated 7 years ago
- ☆18Updated 5 years ago
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Updated 4 years ago