pseeth / otoworldLinks
Applying reinforcement learning to perform source separation.
☆23Updated 4 years ago
Alternatives and similar repositories for otoworld
Users that are interested in otoworld are comparing it to the libraries listed below
Sorting:
- Simple baseline model for the HEAR benchmark☆23Updated 2 weeks ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Updated 4 years ago
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Updated 4 years ago
- ☆25Updated 7 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Evaluation kit for the HEAR Benchmark☆61Updated 2 weeks ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆33Updated last year
- Reproducible Subjective Evaluation☆61Updated last year
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Updated 9 months ago
- ☆32Updated 4 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Updated 5 years ago
- Interference removal algorithm for multitrack live recordings☆11Updated 6 years ago
- ☆16Updated 4 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated last year
- Utilities for resampling and filtering audio data☆47Updated 5 years ago
- A context encoder for audio inpainting☆25Updated 2 years ago
- Paderbox: A collection of utilities for audio / speech processing☆41Updated 2 months ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Updated 4 months ago
- PyTorch implementation of the NSGT/sliCQT☆17Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 4 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆19Updated 5 years ago
- Baseline systems for the FSD50K dataset☆70Updated 4 years ago
- ☆41Updated 5 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago
- Accompanying code for our paper "Point Cloud Audio Processing"☆19Updated 4 years ago
- Permutation invariant training in PyTorch☆13Updated 5 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago