jqi41/Gfcc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jqi41/Gfcc)

jqi41 / Gfcc

Gammatone feature for robust speech recognition

☆14

Alternatives and similar repositories for Gfcc

Users that are interested in Gfcc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weedwind / CNN_denoise
View on GitHub
CNN learns feature mapping between corrupted and clean speech
☆12Aug 14, 2017Updated 8 years ago
miguelki / Emotion_Recognition
View on GitHub
AAU VGIS9 semester project : emotion recognition based on facial features and voice
☆14Jan 12, 2013Updated 13 years ago
MansteinLiliang / Transfer-Learning
View on GitHub
迁移学习
☆18Jul 21, 2018Updated 8 years ago
mispchallenge / MISP2021-AVSR
View on GitHub
repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"
☆18Jun 17, 2022Updated 4 years ago
ododoyo / EHNet
View on GitHub
A neural network consist of cnn and lstm for speech enhancement
☆25Aug 2, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
johnkorn / speaker_recognition
View on GitHub
Speaker recognition and verification with deep learning
☆13Mar 7, 2017Updated 9 years ago
rosrad / asvspoof2017
View on GitHub
some scripts for asvspoof2017
☆11Dec 27, 2018Updated 7 years ago
Seratna / CNN-Speech-Recognition
View on GitHub
☆12Sep 2, 2016Updated 9 years ago
yishuihanhan / myBooks
View on GitHub
收集、分享日常学习使用到的书籍
☆18Dec 4, 2019Updated 6 years ago
fwkz / lpcc-speech-recognition
View on GitHub
Speech recognition using Linear Predictive Cepstral Coefficients and Dynamic Time Wrapping algorithm.
☆15Feb 19, 2014Updated 12 years ago
ensismoebius / voiceSpoofingDetectionWavelet
View on GitHub
A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…
☆12Aug 16, 2023Updated 2 years ago
justinjohn0306 / TTS-TT2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆11Mar 26, 2023Updated 3 years ago
ZhihaoDU / speech_feature_extractor
View on GitHub
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…
☆129Aug 12, 2020Updated 5 years ago
xinyal / Gan-Speech-Synthesis-Research
View on GitHub
This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…
☆17Sep 5, 2016Updated 9 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago
Miffyli / asv-cm-reinforce
View on GitHub
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
☆13Mar 31, 2021Updated 5 years ago
dwgnr / speech-conversion
View on GitHub
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆15Dec 3, 2022Updated 3 years ago
fishWangY / image_fusion
View on GitHub
This repository contains the implementation of 《基于深度堆叠卷积神经网络的图像融合》
☆10Oct 11, 2019Updated 6 years ago
adam2go / mfcc
View on GitHub
Calculate MFCC/Fbank feature for wav files
☆15Nov 21, 2017Updated 8 years ago
Nishanksingla / Caffe-Speaker-Recognition
View on GitHub
CNN to recognize speaker on a spoken numbers dataset
☆18Jan 23, 2017Updated 9 years ago
TimovNiedek / timit_tf
View on GitHub
Code for phonetically classifying TIMIT using TensorFlow
☆17Jul 1, 2016Updated 10 years ago
hongchengzhu / VoxTracer
View on GitHub
Official Implementation of VoxTracer (MM' 23)
☆12Oct 27, 2023Updated 2 years ago
maelfabien / EM_GMM_HMM
View on GitHub
Illustrating EM for GMMs and HMMs
☆12May 9, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
elfchief / mp3gain
View on GitHub
Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)
☆11Jan 3, 2013Updated 13 years ago
wangkenpu / rsrgan
View on GitHub
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Nov 25, 2019Updated 6 years ago
andywag / NeuralHDL
View on GitHub
☆17Jul 21, 2017Updated 9 years ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
changjenyin / DNN_HMM_RNN_speech
View on GitHub
"Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015
☆21Nov 25, 2016Updated 9 years ago
zjzser / WMCodec
View on GitHub
PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…
☆18Jul 31, 2025Updated 11 months ago
zjzser / TraceableSpeech
View on GitHub
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
☆21Apr 18, 2025Updated last year
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
Kyubyong / specAugment
View on GitHub
Tensor2tensor experiment with SpecAugment
☆46May 13, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
stdereka / liverpool-ion-switching
View on GitHub
Liverpool Ion Switching kaggle competition 2nd place winning solution
☆16Mar 25, 2023Updated 3 years ago
gp-b2g / frameworks_base
View on GitHub
frameworks_base for Geeksphone Peak and Keon
☆12Jan 13, 2015Updated 11 years ago
haoxiangsnr / A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
View on GitHub
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…
☆350Sep 5, 2020Updated 5 years ago
yzyouzhang / Awesome-Multimedia-Deepfake-Detection
View on GitHub
Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024
☆17Aug 26, 2024Updated last year
cronrpc / Audio-Speaker-Needle-In-Haystack
View on GitHub
Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。
☆13Jun 17, 2024Updated 2 years ago
balcilar / Audio-Captcha-Recognition
View on GitHub
Recognition of Audio Captcha using SVM
☆25Mar 29, 2019Updated 7 years ago
joelthchao / CV-latex
View on GitHub
latex template for CV
☆18Jan 18, 2025Updated last year