google / df-conformer
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
☆20Updated last year
Related projects: ⓘ
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- ☆24Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated last year
- Viterbi decoding in PyTorch☆23Updated 3 weeks ago
- Unofficial implementation of wavenext vocoder☆28Updated 3 weeks ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆15Updated last month
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆21Updated 5 months ago
- ☆11Updated last year
- ☆13Updated last year
- Adaptive Vocoder for Custom Voice☆58Updated last year
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆26Updated last month
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆33Updated last week
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆21Updated 6 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆38Updated last year
- A CSRankings-like index for speech researchers☆30Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- ☆21Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆12Updated 2 years ago
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆15Updated 2 years ago
- ☆16Updated 8 months ago
- ☆19Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆41Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- ConMamba for Automatic Speech Recognition☆38Updated last month
- ☆18Updated 3 months ago