TomohikoNakamura / asteroid_jaCappellaLinks
☆14Updated 2 years ago
Alternatives and similar repositories for asteroid_jaCappella
Users that are interested in asteroid_jaCappella are comparing it to the libraries listed below
Sorting:
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆72Updated last month
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆37Updated 3 months ago
- ☆28Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago
- Official implementation of Self-Remixing☆15Updated last year
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆59Updated last week
- Implementation of vocoders empowered with pytorch lightning☆17Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Reproducible Subjective Evaluation☆60Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆62Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆43Updated 9 months ago
- Prosody and Pronunciation Modification Network☆56Updated 4 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- ☆57Updated 8 months ago
- ☆44Updated last year
- ☆12Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- ☆58Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆29Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆104Updated this week
- ☆17Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆35Updated 2 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆39Updated 3 months ago
- ☆17Updated 11 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆41Updated 5 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 10 months ago