ideo / LaughDetection
☆126Updated 6 years ago
Alternatives and similar repositories for LaughDetection:
Users that are interested in LaughDetection are comparing it to the libraries listed below
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆101Updated last year
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 8 months ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- A github repo of the openSMILE feature extraction tool.☆217Updated 3 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆35Updated 6 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆241Updated 5 years ago
- ☆40Updated 6 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese☆115Updated 6 years ago
- style token with tacotron2☆61Updated last year
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆26Updated 8 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 4 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆240Updated 5 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆83Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras☆69Updated 7 years ago
- A Demo of Mandarin/Chinese TTS frontend☆278Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆398Updated 4 years ago
- Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017☆72Updated 5 years ago