ideo / LaughDetection
☆126Updated 6 years ago
Alternatives and similar repositories for LaughDetection:
Users that are interested in LaughDetection are comparing it to the libraries listed below
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- ☆258Updated 9 months ago
- Tools for ASR Corpus Generation from Online Video☆140Updated 6 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 4 years ago
- ☆40Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆256Updated 5 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆35Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago
- Looking to listen at cocktail party☆36Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- this is a treasure-house of speech☆164Updated 6 years ago
- A Python toolbox for speech features extraction☆161Updated 2 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆128Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆401Updated 5 years ago
- A Demo of Mandarin/Chinese TTS frontend☆278Updated 3 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆165Updated 2 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆73Updated 5 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆240Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 2 years ago
- style token with tacotron2☆61Updated last year
- Voice Activity Detector☆73Updated 2 years ago
- ☆130Updated 6 years ago
- A github repo of the openSMILE feature extraction tool.☆217Updated 3 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 8 months ago