YuzhongHuangCS / SDHummingLinks
Unofficial Repository - Code Collected From Internet - SDHBuildModel - SDFuzzySearch - SDHumming - SDHummingDemo
☆18Updated 9 years ago
Alternatives and similar repositories for SDHumming
Users that are interested in SDHumming are comparing it to the libraries listed below
Sorting:
- 这是一个基于kaldi的iOS语音识别demo☆28Updated 6 years ago
- 基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)☆74Updated 4 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 5 years ago
- Recognize Audio Emotion.☆89Updated 5 years ago
- It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。☆19Updated last year
- Explore Text-To-Speech☆25Updated 7 years ago
- The code for aishell-3 baseline acoustic model☆69Updated 4 years ago
- A little useful toolbox for python.☆77Updated 5 years ago
- Kaldi-based goodness of pronunciation (GOP)☆154Updated 4 years ago
- automatic vibrato and portamento detection and analysis tool☆12Updated 3 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆233Updated 6 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Updated 8 years ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆54Updated 2 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆260Updated 6 years ago
- Connectionist Temporal Classification (CTC) Automatic Speech Recognition☆296Updated 7 years ago
- ITU P.563 code with minor modifications to make it run on Mac☆39Updated 8 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆28Updated last year
- 集成Webrtc的VAD,用于切分音频文件☆344Updated 5 years ago
- ☆70Updated 4 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15Updated 6 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆53Updated 2 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Updated 5 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 6 years ago
- Source code for 'Music source separation using stacked hourglass networks', ISMIR 2018☆45Updated 7 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Updated 6 years ago
- Deepmind's Tacotron-2 Tensorflow implementation☆163Updated 5 years ago
- N/A☆180Updated 3 years ago
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Updated 2 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆92Updated 6 years ago