Kevinnan-teen / Speaker-RecognitionLinks
说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。
☆101Updated 2 years ago
Alternatives and similar repositories for Speaker-Recognition
Users that are interested in Speaker-Recognition are comparing it to the libraries listed below
Sorting:
- Audio Split 基于双门限法的语音端点检测及语音分割☆134Updated 5 years ago
- 使用Tensorflow实现声纹识别☆320Updated last year
- 基于Pytorch实现的语音情感识别☆227Updated 4 months ago
- 基于Tensorflow实现声音分类,博客地址:☆102Updated 5 years ago
- 使用python进行语音识别☆162Updated 3 years ago
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆534Updated 4 months ago
- 语音信号处理试验教程,Python代码☆336Updated 3 years ago
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆37Updated 5 years ago
- 用CASIA database数据集做的,做的语音情感识别和语音识人的练习☆70Updated 2 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆94Updated 6 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆205Updated 5 years ago
- 基于Kersa实现的声纹识别模型☆147Updated 11 months ago
- 基于dVector的说话人识别keras☆90Updated 4 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆176Updated 6 years ago
- 语音感情识别☆38Updated 4 months ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆732Updated last year
- 这个项目将 RAVDESS 数据集切割成 1s 短语音,利用 openSMILE+CNN 进行训练,目标是将短语音分类到四种情感中,分别是:开心(happy)、悲伤(sad)、生气(angry)和中性(neutral)。最后准确率达到 76% 左右。☆59Updated 4 years ago
- 基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。☆172Updated last year
- 基于python的hmm-gmm声学模型☆29Updated 6 years ago
- 基于深度学习的语音增强、去混响☆96Updated last year
- 语音识别 MFCCs特征处理 cnn神经网络☆100Updated 6 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆95Updated 3 years ago
- 利用Python+TensorFlow实现语音识别☆48Updated 6 years ago
- An Open Source Tools for Speaker Recognition☆625Updated last year
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆285Updated 3 months ago
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆97Updated 4 months ago
- The dataset of Speech Recognition☆426Updated 8 months ago
- Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)☆250Updated 5 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53Updated 6 years ago
- 基于傅里叶变换的降噪与基于深度学习的语音识别的多功能系统☆14Updated 2 years ago