danpovey / conditional-flow-matchingLinks
☆25Updated 9 months ago
Alternatives and similar repositories for conditional-flow-matching
Users that are interested in conditional-flow-matching are comparing it to the libraries listed below
Sorting:
- Temporary anonymous version☆22Updated last year
- A spoken version of the textual story cloze benchmark☆17Updated last year
- ☆15Updated 4 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated last month
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆55Updated last month
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 3 weeks ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆19Updated 2 years ago
- ☆25Updated 7 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 4 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆26Updated 4 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- ☆64Updated 3 years ago
- ☆36Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43Updated 2 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆26Updated 2 weeks ago
- with alignment learning and continuous wavelet transform☆21Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- ☆25Updated 10 months ago
- ☆25Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- (WIP)long form speech generatoins☆31Updated 2 months ago
- ☆47Updated 2 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- ☆62Updated last year