teo-sl / Audio-Super-Resolution-ViT
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
☆14Updated last year
Alternatives and similar repositories for Audio-Super-Resolution-ViT:
Users that are interested in Audio-Super-Resolution-ViT are comparing it to the libraries listed below
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 2 months ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆15Updated 2 years ago
- Implementation of Emo-StarGAN☆46Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆72Updated 2 weeks ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- ☆79Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- ☆65Updated last week
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆49Updated 3 months ago
- ☆48Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 9 months ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆51Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 7 months ago
- Unofficial implementation of wavenext vocoder☆40Updated 5 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆83Updated 10 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- ☆23Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- ☆37Updated 7 months ago
- ☆43Updated 7 months ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆77Updated last year