teo-sl / Audio-Super-Resolution-ViT
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Audio-Super-Resolution-ViT
- Adaptive Vocoder for Custom Voice☆58Updated 2 years ago
- ☆79Updated last year
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆50Updated 2 years ago
- iSeparate library for the SDX2023 challenge☆13Updated 10 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated last month
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆18Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆32Updated 3 weeks ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Viterbi decoding in PyTorch☆26Updated last month
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- ☆48Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆28Updated this week
- ☆20Updated 10 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 8 months ago
- ☆61Updated 7 months ago
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 8 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆109Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆78Updated 7 months ago
- SRTNet☆24Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆43Updated last year
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 7 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆70Updated 3 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year