pseeth / otoworld
Applying reinforcement learning to perform source separation.
☆21Updated 4 years ago
Alternatives and similar repositories for otoworld
Users that are interested in otoworld are comparing it to the libraries listed below
Sorting:
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆36Updated 4 months ago
- PyTorch Dataset for Speech and Music audio☆75Updated 10 months ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆33Updated 4 years ago
- ☆32Updated 4 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆11Updated 5 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last week
- A C++/Cython audio limiter for Python.☆25Updated 2 years ago
- A context encoder for audio inpainting☆25Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Updated last month
- Reproducible Subjective Evaluation☆59Updated last year
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Updated 3 years ago
- Interference removal algorithm for multitrack live recordings☆11Updated 6 years ago
- Utilities for resampling and filtering audio data☆47Updated 5 years ago
- Translating Synthetic RIRs to Real RIRs☆41Updated last year
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- ☆47Updated 6 years ago
- Code for paper submission under review.☆33Updated 7 years ago
- ☆25Updated 7 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆51Updated 4 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆14Updated last year
- Permutation invariant training in PyTorch☆13Updated 4 years ago
- Zounds is a dataflow library for building directed acyclic graphs that transform audio. It uses the featureflow library to define the pro…☆24Updated 2 years ago
- audioLIME: Listenable Explanations Using Source Separation☆35Updated 3 years ago
- ☆16Updated 4 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Da - ECHO - RetrievAl - daTasEt☆26Updated 10 months ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆43Updated 4 years ago