Barbany / Multi-speaker-Neural-Vocoder
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommunications Technologies and Services Engineering
☆15Updated last month
Related projects: ⓘ
- ☆26Updated 3 years ago
- ☆18Updated 4 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated last year
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆36Updated 4 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆46Updated last year
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆32Updated 8 months ago
- Code for paper submission under review.☆33Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago
- Conditioned U-Net for Music Source Separation☆20Updated 3 years ago
- PyTorch implementation of NVIDIA WaveGlow with constant memory cost.☆34Updated last year
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- A context encoder for audio inpainting☆25Updated last year
- ☆29Updated 4 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Interspeech 2019 tutorial materials☆48Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆31Updated 3 years ago
- ☆18Updated 5 years ago
- ☆19Updated 6 years ago
- Code for our paper "VaPar Synth - A Variational Parametric Model for Audio Synthesis"☆32Updated 4 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- This repository contains laughter-related synthesis systems.☆13Updated 3 years ago
- ☆63Updated last year
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆20Updated 3 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 4 years ago
- ☆23Updated this week