wq2012 / VoiceIdentityBook
《声纹技术:从核心算法到工程实践》
☆154Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for VoiceIdentityBook
- ☆142Updated 4 years ago
- 基于深度学习的声学回声消除基线代码☆129Updated 3 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆50Updated 5 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆187Updated 3 weeks ago
- implementation of rnnoise_16k☆123Updated 3 years ago
- A python package for calculating the PESQ.☆357Updated last year
- ☆85Updated 3 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆48Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆33Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆114Updated 2 years ago
- This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》☆37Updated 3 years ago
- Kaldi-based goodness of pronunciation (GOP)☆147Updated 3 years ago
- ☆105Updated last year
- This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.☆90Updated 4 years ago
- A statistical model-based Voice Activity Detection☆190Updated 5 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆105Updated 4 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆61Updated 2 years ago
- ☆69Updated 3 years ago
- ☆107Updated 3 years ago
- ☆119Updated 3 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆188Updated 7 months ago
- simple dnn based vad☆70Updated 5 years ago
- this is a treasure-house of speech☆164Updated 6 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆245Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆89Updated 3 years ago
- AEC Challenge☆386Updated 5 months ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆121Updated 4 years ago
- Audio Split 基于双门限法的语音端点检测及语音分割☆127Updated 4 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆88Updated 5 years ago
- Acoustic Echo Cancellation with Nerual Kalman Filtering☆238Updated last year