TomohikoNakamura / asteroid_jaCappellaLinks
☆13Updated last year
Alternatives and similar repositories for asteroid_jaCappella
Users that are interested in asteroid_jaCappella are comparing it to the libraries listed below
Sorting:
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated last month
- ☆57Updated 6 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆27Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆39Updated 7 months ago
- A repository of Japanese Phoneme-Level BERT☆22Updated last year
- ☆27Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆37Updated 2 months ago
- Chorale Music Separation Dataset and Model Framework☆37Updated 2 years ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆46Updated last month
- Implementation of "Self-Supervised Contrastive Learning for Singing Voices"☆19Updated 3 years ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆56Updated 2 months ago
- Implementation of vocoders empowered with pytorch lightning☆17Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Reproducible Subjective Evaluation☆60Updated last year
- Multi-lingual AudioCaps☆11Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- ☆17Updated 11 months ago
- ☆10Updated last year
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆26Updated 3 months ago
- ☆55Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆36Updated last year
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆36Updated last month
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated last year
- ☆63Updated last year
- ☆9Updated 3 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year