google / df-conformer
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
☆20Updated last year
Alternatives and similar repositories for df-conformer:
Users that are interested in df-conformer are comparing it to the libraries listed below
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 6 months ago
- ☆25Updated this week
- ☆11Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 weeks ago
- Streaming Vocos☆20Updated last month
- Unofficial implementation of wavenext vocoder☆42Updated 5 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆42Updated 4 months ago
- ☆18Updated 9 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 6 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Official implementation of Self-Remixing☆13Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆39Updated 6 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆24Updated 4 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- ☆22Updated 3 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆44Updated 7 months ago
- Viterbi decoding in PyTorch☆27Updated 4 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆47Updated last year
- ☆60Updated last year
- ☆24Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated 2 years ago