Gammatone feature for robust speech recognition
☆14Aug 1, 2016Updated 9 years ago
Alternatives and similar repositories for Gfcc
Users that are interested in Gfcc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CNN learns feature mapping between corrupted and clean speech☆12Aug 14, 2017Updated 8 years ago
- AAU VGIS9 semester project : emotion recognition based on facial features and voice☆14Jan 12, 2013Updated 13 years ago
- 迁移学习☆18Jul 21, 2018Updated 7 years ago
- some scripts for asvspoof2017☆11Dec 27, 2018Updated 7 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Sep 2, 2016Updated 9 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆130Aug 12, 2020Updated 5 years ago
- This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…☆17Sep 5, 2016Updated 9 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- 收集、分享日常学习使用到的书籍☆18Dec 4, 2019Updated 6 years ago
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- This repository contains the implementation of 《基于深度堆叠卷积神经网络的图像融合》☆10Oct 11, 2019Updated 6 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 7 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Guest lecture for Music 364, CCRMA, Stanford University, with Blair Kaneshiro.☆13Jan 28, 2017Updated 9 years ago
- A bunch of experiments using Bark and Mel scales, wavelets and paraconsistent feature engineering in order to find the best methods to cl…☆13Aug 16, 2023Updated 2 years ago
- Haskell to D3.js binding by deep EDSL approach.☆23Sep 20, 2014Updated 11 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- CNN to recognize speaker on a spoken numbers dataset☆18Jan 23, 2017Updated 9 years ago
- Code for phonetically classifying TIMIT using TensorFlow☆18Jul 1, 2016Updated 9 years ago
- Official Implementation of VoxTracer (MM' 23)☆11Oct 27, 2023Updated 2 years ago
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Nov 25, 2019Updated 6 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- ☆17Jul 21, 2017Updated 8 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Nov 25, 2016Updated 9 years ago
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 9 months ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆21Apr 18, 2025Updated last year
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆347Sep 5, 2020Updated 5 years ago
- AWS virtual infrastructure simulator for training reinforcement learning based cloud capacity management systems☆11Sep 23, 2020Updated 5 years ago
- ☆24Apr 13, 2018Updated 8 years ago
- Augmenting Room Impulse Response☆43Sep 15, 2023Updated 2 years ago
- ☆18Jan 10, 2024Updated 2 years ago
- Filling and manupulation with histograms☆17Mar 10, 2025Updated last year