pseeth / otoworldLinks
Applying reinforcement learning to perform source separation.
☆23Updated 4 years ago
Alternatives and similar repositories for otoworld
Users that are interested in otoworld are comparing it to the libraries listed below
Sorting:
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Updated 4 years ago
- Reproducible Subjective Evaluation☆60Updated last year
- ☆32Updated 4 years ago
- ☆25Updated 7 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Updated 4 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Updated 5 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Updated 3 weeks ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- Permutation invariant training in PyTorch☆13Updated 4 years ago
- Interference removal algorithm for multitrack live recordings☆11Updated 6 years ago
- ☆16Updated 4 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Updated 9 months ago
- Asteroid's filterbanks☆86Updated 8 months ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆33Updated last year
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago
- Utilities for resampling and filtering audio data☆47Updated 5 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated last year
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆19Updated 5 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 4 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆63Updated 5 years ago
- Accompanying code for our paper "Point Cloud Audio Processing"☆19Updated 4 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆81Updated 4 years ago
- Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models☆36Updated last month
- Frontend filterbank learning module with HVQT initialization capabilities.☆21Updated last year
- Speech enhancement using mimic loss☆16Updated 5 years ago
- Paderbox: A collection of utilities for audio / speech processing☆41Updated 2 months ago
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Updated 3 years ago