LvHang / pitch
a standalone pitch extractor
☆13Updated 6 years ago
Related projects: ⓘ
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- ☆75Updated 2 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆37Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆36Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 3 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- ☆23Updated this week
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- ☆52Updated 3 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- ☆17Updated 6 years ago
- ☆41Updated 3 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Updated 5 years ago
- ☆16Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆84Updated 4 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆15Updated 4 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- ☆50Updated this week
- ☆16Updated this week
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Updated 3 years ago
- ☆34Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago