ideo / LaughDetection
☆124Updated 5 years ago
Alternatives and similar repositories for LaughDetection:
Users that are interested in LaughDetection are comparing it to the libraries listed below
- Tools for ASR Corpus Generation from Online Video☆139Updated 5 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 5 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆396Updated 4 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 6 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆239Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆300Updated 4 years ago
- ☆40Updated 6 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆99Updated 2 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- (已过时)WaveNet 声码器☆21Updated 4 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆99Updated 10 months ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆127Updated 3 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆69Updated 5 years ago
- Looking to listen at cocktail party☆36Updated last year
- ESPnet Model Zoo☆245Updated last year
- A github repo of the openSMILE feature extraction tool.☆213Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆122Updated 5 years ago