jaekookang / p2fa_py3
Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3
☆96Updated 6 months ago
Related projects: ⓘ
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆229Updated 4 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 6 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆184Updated 4 years ago
- ☆180Updated 4 months ago
- Charsiu: A neural phonetic aligner.☆267Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆78Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆84Updated 3 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆44Updated 3 years ago
- This is the GitHub page for publicly available emotional speech data.☆314Updated 2 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆135Updated 2 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆67Updated 5 years ago
- Phoneme Recognition using RecNet☆90Updated 7 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆281Updated 10 months ago
- Code to train and run Blow☆143Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- Python interface for forced audio alignment using HTK and SoX☆331Updated 4 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆182Updated 4 years ago
- A Python toolbox for speech features extraction☆158Updated last year
- DeepSpeech based forced alignment tool☆232Updated 3 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 3 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆126Updated 4 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 4 years ago
- AVSpeech downloader☆65Updated 5 years ago
- Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder☆145Updated 5 years ago
- Mel cepstral distortion (MCD) computations in python.☆207Updated 7 years ago
- A Toolkit for ToBI Labeling with Python Data Structures☆24Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆81Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated last year
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆110Updated 3 years ago