TomohikoNakamura / asteroid_jaCappella
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for asteroid_jaCappella
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆47Updated last month
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆51Updated last year
- ☆11Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆33Updated this week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).☆24Updated 5 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- A repository of Japanese Phoneme-Level BERT☆20Updated 11 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆39Updated last week
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆27Updated 6 months ago
- ☆51Updated this week
- ☆19Updated last year
- RWCP-SSD-Onomatopoeia☆21Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 2 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- The code used for TASLP 2022. The latest version is available in SoundSourceSeparation repository.☆8Updated 2 years ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆22Updated last year
- SelfRemaster: SSL Speech Restoration☆85Updated 10 months ago
- Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus☆21Updated 5 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- Source code of APNet2, a vocoder☆51Updated 11 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- ☆40Updated 5 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- Inference code for PaSST, using the HEAR API.☆29Updated 10 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Audio production style transfer with inference-time optimization☆26Updated this week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆17Updated last month