LvHang / pitch
a standalone pitch extractor
☆13Updated 7 years ago
Alternatives and similar repositories for pitch:
Users that are interested in pitch are comparing it to the libraries listed below
- ☆76Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Updated 5 years ago
- ☆51Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- ☆52Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- ☆48Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- VoxSRC Challenge☆31Updated 5 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 2 months ago
- Recurrent Neural Aligner☆50Updated 5 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- Pulse Model vocoder☆42Updated 6 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 4 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆38Updated 5 years ago
- ☆41Updated 6 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- ☆34Updated 5 years ago