Gammatone feature for robust speech recognition
☆14Aug 1, 2016Updated 9 years ago
Alternatives and similar repositories for Gfcc
Users that are interested in Gfcc are comparing it to the libraries listed below
Sorting:
- CNN learns feature mapping between corrupted and clean speech☆12Aug 14, 2017Updated 8 years ago
- 迁移学习☆18Jul 21, 2018Updated 7 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- Speaker recognition and verification with deep learning☆13Mar 7, 2017Updated 9 years ago
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- ☆12Sep 2, 2016Updated 9 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆129Aug 12, 2020Updated 5 years ago
- This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…☆17Sep 5, 2016Updated 9 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- 收集、分享日常学习使用到的书籍☆18Dec 4, 2019Updated 6 years ago
- This repository contains the implementation of 《基于深度堆叠卷积神经网络的图像融合》☆10Oct 11, 2019Updated 6 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 6 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.☆15Feb 19, 2014Updated 12 years ago
- A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…☆13Aug 16, 2023Updated 2 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- CNN to recognize speaker on a spoken numbers dataset☆18Jan 23, 2017Updated 9 years ago
- Code for phonetically classifying TIMIT using TensorFlow☆18Jul 1, 2016Updated 9 years ago
- Official Implementation of VoxTracer (MM' 23)☆11Oct 27, 2023Updated 2 years ago
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Nov 25, 2019Updated 6 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 7 months ago
- ☆12May 5, 2017Updated 8 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆21Apr 18, 2025Updated 11 months ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆346Sep 5, 2020Updated 5 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 8 years ago
- ☆24Apr 13, 2018Updated 7 years ago
- Augmenting Room Impulse Response☆43Sep 15, 2023Updated 2 years ago
- ☆18Jan 10, 2024Updated 2 years ago
- gypified libfaad C library☆15Apr 12, 2013Updated 12 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024☆17Aug 26, 2024Updated last year