YuzhongHuangCS / SDHummingLinks
Unofficial Repository - Code Collected From Internet - SDHBuildModel - SDFuzzySearch - SDHumming - SDHummingDemo
☆18Updated 10 years ago
Alternatives and similar repositories for SDHumming
Users that are interested in SDHumming are comparing it to the libraries listed below
Sorting:
- It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。☆20Updated last year
- 这是一个基于kaldi的iOS语音识别demo☆28Updated 6 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆302Updated 5 years ago
- The code for aishell-3 baseline acoustic model☆69Updated 5 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆173Updated 2 months ago
- ☆276Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Updated 3 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆232Updated last month
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆144Updated 3 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆260Updated 6 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆54Updated 2 years ago
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆69Updated 4 years ago
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15Updated 6 years ago
- 基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)☆74Updated 4 years ago
- ITU P.563 code with minor modifications to make it run on Mac☆39Updated 8 years ago
- This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022☆145Updated 2 years ago
- Tree visualization of the AudioSet Ontology - https://github.com/audioset/ontology☆18Updated last year
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Updated 5 years ago
- The repo provides information about KeSpeech dataset.☆167Updated 3 years ago
- Chinese Text Normalization and Dataset☆89Updated 3 years ago
- automatic vibrato and portamento detection and analysis tool☆12Updated 3 years ago
- Kaldi-based goodness of pronunciation (GOP)☆156Updated 4 years ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆54Updated 2 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- ☆70Updated 5 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆235Updated 6 years ago
- ☆61Updated 2 years ago
- ☆76Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Updated 5 years ago
- 论文复现,使用pos标记进行中文多音字消歧☆21Updated 6 years ago