yuyq96 / kaldifeat
A light-weight Python library for computing Kaldi-style acoustic features based on NumPy
☆14Updated 4 years ago
Related projects: ⓘ
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- TTS Text Analyzer☆31Updated last year
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆37Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆79Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- Production first, nn-based on-device signal processing toolkit.☆63Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- Yin pitch estimator in PyTorch☆113Updated last year
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆31Updated 4 years ago
- A Pytorch version of LPCNet, including dump weight☆30Updated 2 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆67Updated 3 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆74Updated 3 weeks ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated last year
- End-to-end diarization loss☆19Updated 3 years ago
- Speech samples and code of BEdit-TTS☆32Updated 11 months ago
- Pytorch implementation of subband decomposition☆88Updated 2 years ago
- ☆54Updated 3 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 6 months ago
- multilingual speech aligner☆70Updated 10 months ago
- ☆13Updated 2 years ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆32Updated last year
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆26Updated 2 years ago