DCASE2023-Task7-Foley-Sound-Synthesis / dcase2023_task7_baseline
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dcase2023_task7_baseline
- Translating Synthetic RIRs to Real RIRs☆40Updated last year
- Query-conditioned target sound extraction model☆17Updated 2 weeks ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆34Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆22Updated 3 months ago
- ☆27Updated 2 weeks ago
- Implementation of FiNS model for RIR estimation☆25Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- experiments about AudioSet☆43Updated last year
- Augmenting Room Impulse Response☆37Updated last year
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆49Updated 8 months ago
- ☆43Updated 2 weeks ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆52Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆26Updated 5 months ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago
- ☆21Updated 7 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- music denoising network☆11Updated last month
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago
- ☆49Updated last year
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- ☆19Updated last year