wq2012 / VoiceIdentityBook
《声纹技术:从核心算法到工程实践》
☆165Updated 2 years ago
Alternatives and similar repositories for VoiceIdentityBook:
Users that are interested in VoiceIdentityBook are comparing it to the libraries listed below
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆197Updated 2 weeks ago
- ☆143Updated 4 years ago
- ☆116Updated 2 years ago
- implementation of rnnoise_16k☆131Updated 3 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- Kaldi-based goodness of pronunciation (GOP)☆150Updated 4 years ago
- ☆526Updated 3 years ago
- A python package for calculating the PESQ.☆381Updated 2 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 5 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆389Updated 11 months ago
- ☆86Updated 5 months ago
- This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》☆40Updated 3 years ago
- 基于深度学习的声学回声消除基线代码☆140Updated 3 years ago
- ☆123Updated 3 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆258Updated 2 years ago
- A unofficial Pytorch implementation of Microsoft's PHASEN☆228Updated last year
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆201Updated last year
- You can find the speech algorithms you want here☆799Updated 4 months ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆118Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆98Updated last week
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆210Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆401Updated 5 years ago
- The dataset of Speech Recognition☆413Updated 4 months ago
- ☆106Updated 4 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆24Updated 10 months ago
- A 10000+ hours dataset for Chinese speech recognition☆533Updated last year
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- AEC Challenge☆413Updated 11 months ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆339Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago