teo-sl / Audio-Super-Resolution-ViT
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
☆14Updated 2 years ago
Alternatives and similar repositories for Audio-Super-Resolution-ViT:
Users that are interested in Audio-Super-Resolution-ViT are comparing it to the libraries listed below
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆73Updated 3 months ago
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- ☆43Updated 10 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆66Updated last month
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- ☆83Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆62Updated last week
- ☆65Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated 9 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 5 months ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆81Updated last year
- Audio Super-Resolution using Deep Learning☆8Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆87Updated last year
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- ☆30Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year