sony / ai-research-codeLinks
☆355Updated last year
Alternatives and similar repositories for ai-research-code
Users that are interested in ai-research-code are comparing it to the libraries listed below
Sorting:
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆365Updated 2 years ago
- A library for speech data augmentation in time-domain☆668Updated 3 years ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆323Updated last year
- A PyTorch implementation of DNN-based source separation.☆302Updated 3 years ago
- A fast, high-quality neural vocoder.☆288Updated 2 years ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆329Updated 2 years ago
- Audio transformations library for PyTorch☆233Updated 3 years ago
- ☆497Updated last year
- see README☆353Updated this week
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021☆284Updated 3 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆265Updated last year
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆403Updated 4 years ago
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆203Updated 2 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆139Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆445Updated 5 months ago
- An STFT/iSTFT for PyTorch.☆364Updated last year
- An open source dataset for source separation☆437Updated last year
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆320Updated last year
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆337Updated 2 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆384Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆259Updated last week
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆215Updated 2 years ago
- This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf☆398Updated 3 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆214Updated last year
- PyTorch reimplementation of per-channel energy normalization for audio.☆100Updated 6 years ago
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆215Updated 4 years ago
- Improved Wave-U-Net implemented in Pytorch☆350Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆511Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆343Updated last year