wq2012 / VoiceIdentityBook
《声纹技术:从核心算法到工程实践》
☆158Updated 2 years ago
Alternatives and similar repositories for VoiceIdentityBook:
Users that are interested in VoiceIdentityBook are comparing it to the libraries listed below
- ☆142Updated 4 years ago
- An Open Source Tools for Speaker Recognition☆611Updated 6 months ago
- Kaldi-based goodness of pronunciation (GOP)☆147Updated 4 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆117Updated 2 years ago
- 基于dVector的说话人识别keras☆87Updated 4 years ago
- 基于深度学习的声学回声消除基线代码☆132Updated 3 years ago
- A python package for calculating the PESQ.☆369Updated last year
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆194Updated 2 weeks ago
- ☆106Updated 3 years ago
- this is a treasure-house of speech☆164Updated 6 years ago
- ☆112Updated last year
- A summary of speech data augment algorithms☆68Updated 4 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆74Updated 2 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆106Updated 4 years ago
- Towards hot directions in industrial end to end speech recognition☆326Updated 3 years ago
- Chinese keyword spotting model using LSTM RNN☆172Updated 6 years ago
- A unofficial Pytorch implementation of Microsoft's PHASEN☆227Updated 10 months ago
- simple dnn based vad☆70Updated 6 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆194Updated 10 months ago
- A statistical model-based Voice Activity Detection☆190Updated 6 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 5 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆24Updated 7 months ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆49Updated 6 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆338Updated 4 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆310Updated 4 years ago
- ☆411Updated last year
- A CRF-based ASR Toolkit☆328Updated 6 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆396Updated 4 years ago