zengchang233 / GMM_baseline
未来杯语音赛道说话人识别的baseline
☆48Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for GMM_baseline
- 基于dVector的说话人识别keras☆87Updated 3 years ago
- speaker recognition using keras☆36Updated last year
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆52Updated 6 years ago
- 基于深度学习的语音增强、去混响☆87Updated 9 months ago
- 语音信号处理的基本知识☆35Updated 5 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆50Updated 5 years ago
- 分别在VCTK、AISHELL1 和 VoxCeleb1 三个标准公开数据集上对三种端到端声纹模型框架(Deep Speaker, RawNet, GE2E)进行实验比较。☆22Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 5 years ago
- This is a implementation of kaldi-plda.☆15Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- ☆35Updated 5 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆57Updated 4 years ago
- Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM☆113Updated 5 years ago
- [INTERSPEECH 2019] Waiting Update! This project is a demonstration of the paper UNetGAN: A Robust Speech Enhancement Approach in Time Dom…☆20Updated 5 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆39Updated 4 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆45Updated 2 years ago
- Baseline of dcase 2019 task 4☆59Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- A neural network consist of cnn and lstm for speech enhancement☆24Updated 6 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆42Updated 5 years ago
- The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation☆42Updated 5 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 6 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆121Updated 4 years ago
- some scripts for asvspoof2017☆11Updated 5 years ago
- Code for DCASE 2020 task 1a and task 1b.☆85Updated 2 years ago
- ☆51Updated 3 years ago
- 优研项目——说话人识别☆8Updated 9 years ago