wq2012 / VoiceIdentityBookLinks
《声纹技术:从核心算法到工程实践》
☆170Updated 2 years ago
Alternatives and similar repositories for VoiceIdentityBook
Users that are interested in VoiceIdentityBook are comparing it to the libraries listed below
Sorting:
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- A python package for calculating the PESQ.☆388Updated 3 weeks ago
- ☆144Updated 5 years ago
- ☆536Updated 4 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆267Updated 3 years ago
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53Updated 6 years ago
- Chinese keyword spotting model using LSTM RNN☆173Updated 6 years ago
- An Open Source Tools for Speaker Recognition☆621Updated last year
- simple dnn based vad☆70Updated 6 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆203Updated this week
- 集成Webrtc的VAD,用于切分音频文件☆342Updated 4 years ago
- You can find the speech algorithms you want here☆823Updated 2 weeks ago
- AEC Challenge☆430Updated last year
- implementation of rnnoise_16k☆136Updated 3 years ago
- A release version for https://github.com/athena-team/athena☆127Updated 2 years ago
- Kaldi model converter to ONNX☆244Updated 2 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆232Updated 6 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆51Updated 6 years ago
- 基于深度学习的声学回声消除基线代码☆151Updated 4 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆392Updated last year
- ☆88Updated 8 months ago
- this is a treasure-house of speech☆164Updated 7 years ago
- Python bindings of WebRTC Audio Processing☆193Updated 3 months ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 4 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆407Updated 5 years ago
- A book about Text-to-Speech (TTS) in Chinese.☆606Updated 3 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆24Updated last year
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆34Updated 4 years ago