ideo / LaughDetectionLinks
☆126Updated 6 years ago
Alternatives and similar repositories for LaughDetection
Users that are interested in LaughDetection are comparing it to the libraries listed below
Sorting:
- Tools for ASR Corpus Generation from Online Video☆140Updated 6 years ago
- A github repo of the openSMILE feature extraction tool.☆219Updated 3 years ago
- Phoneme Recognition using RecNet☆96Updated 8 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆122Updated 5 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆37Updated 7 years ago
- this is a treasure-house of speech☆166Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Updated 6 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 5 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- ☆40Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆409Updated 5 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- ASR for Chinese Mandarin☆76Updated 7 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆242Updated 6 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆303Updated 5 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated this week
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆244Updated 5 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- ☆58Updated 6 years ago
- Voice Activity Detector☆74Updated 2 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- speech-to-text in pytorch☆83Updated 6 years ago