ucbvislab / p2fa-vislabLinks
A script for audio/transcript alignment. Fork of p2fa.
☆69Updated 7 years ago
Alternatives and similar repositories for p2fa-vislab
Users that are interested in p2fa-vislab are comparing it to the libraries listed below
Sorting:
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Updated 3 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24Updated 6 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- ☆59Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last year
- Phoneme Recognition using RecNet☆97Updated 9 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 6 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Updated 8 years ago
- Python interface for forced audio alignment using HTK and SoX☆350Updated 5 years ago
- A repository for maintaing the fave-align and fave-extract toolkits☆118Updated last year
- Cross-lingual Voice Conversion☆97Updated 8 years ago
- DeepSpeech based forced alignment tool☆239Updated 5 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Updated 6 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 7 years ago
- Adapting your own Language Model for Kaldi☆63Updated 7 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 7 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- HTK features in Python☆73Updated 3 months ago
- Automatic prosodic annotation tool written in Java.☆64Updated 6 years ago
- Util code, issues, discussions☆29Updated 7 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Updated 3 weeks ago
- Human Voice Wave Samples☆82Updated 11 years ago
- ☆81Updated 8 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Multilingual Grapheme to Phoneme☆51Updated 9 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆32Updated 7 years ago