cnlinxi / speech_emotionLinks
Detect emotion from audio
☆13Updated 6 years ago
Alternatives and similar repositories for speech_emotion
Users that are interested in speech_emotion are comparing it to the libraries listed below
Sorting:
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 7 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆54Updated last year
- ☆33Updated 3 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 6 years ago
- ☆22Updated 5 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆61Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆11Updated 6 years ago
- ☆20Updated 5 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆22Updated 5 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆52Updated last month
- ☆13Updated 4 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆38Updated 5 years ago
- Core code for my ICASSP 2018 paper☆53Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago