wq2012 / VoiceIdentityBookLinks
《声纹技术:从核心算法到工程实践》
☆166Updated 2 years ago
Alternatives and similar repositories for VoiceIdentityBook
Users that are interested in VoiceIdentityBook are comparing it to the libraries listed below
Sorting:
- ☆143Updated 4 years ago
- Kaldi-based goodness of pronunciation (GOP)☆151Updated 4 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 6 years ago
- Kaldi model converter to ONNX☆244Updated 2 years ago
- You can find the speech algorithms you want here☆806Updated 5 months ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆259Updated 3 years ago
- A python package for calculating the PESQ.☆381Updated 2 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- An Open Source Tools for Speaker Recognition☆616Updated 9 months ago
- A release version for https://github.com/athena-team/athena☆127Updated 2 years ago
- ☆529Updated 3 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆202Updated last year
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆340Updated 4 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆400Updated 5 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆119Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆199Updated last month
- This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》☆40Updated 3 years ago
- A summary of speech data augment algorithms☆68Updated 4 years ago
- ☆86Updated 5 months ago
- ☆118Updated 2 years ago
- A unofficial Pytorch implementation of Microsoft's PHASEN☆229Updated last year
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆93Updated 5 years ago
- ☆125Updated 3 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- ☆61Updated 2 years ago
- simple dnn based vad☆70Updated 6 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆230Updated 6 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆99Updated last week