sony / ai-research-codeLinks
☆356Updated last year
Alternatives and similar repositories for ai-research-code
Users that are interested in ai-research-code are comparing it to the libraries listed below
Sorting:
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆368Updated 2 years ago
- A library for speech data augmentation in time-domain☆669Updated 3 years ago
- A PyTorch implementation of DNN-based source separation.☆304Updated 3 years ago
- A fast, high-quality neural vocoder.☆290Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆403Updated 4 years ago
- see README☆356Updated last month
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆327Updated last year
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆331Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆338Updated 2 years ago
- Audio transformations library for PyTorch☆233Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021☆284Updated 3 years ago
- ☆499Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆446Updated 6 months ago
- An STFT/iSTFT for PyTorch.☆365Updated last year
- Improved Wave-U-Net implemented in Pytorch☆354Updated last year
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆203Updated 2 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆202Updated 4 years ago
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆217Updated 4 years ago
- An open source dataset for source separation☆440Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆259Updated 2 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆217Updated 2 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆384Updated last year
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆266Updated last month
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆353Updated 3 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆140Updated last year
- General Speech Restoration☆283Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆295Updated last year
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- museval - source separation evaluation tools for python☆221Updated 3 months ago