C code to extract mfcc or fbank features from wav files
☆17Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for MFCC
Users that are interested in MFCC are comparing it to the libraries listed below
Sorting:
- The official repository of the Eesen project☆12Jun 20, 2018Updated 7 years ago
- mfcc, mel, pcen. (librosa)☆36Nov 20, 2019Updated 6 years ago
- Super spectrogram Qt cross platform!☆13Apr 4, 2017Updated 8 years ago
- dlib implementation of Siamese Network Training with Caffe☆11Mar 7, 2018Updated 8 years ago
- ☆13May 18, 2022Updated 3 years ago
- Audio WAV file tools for C# read and write, 8 and 16 bits, mono and stereo.☆11Oct 20, 2015Updated 10 years ago
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆22Jul 31, 2021Updated 4 years ago
- ☆16Aug 10, 2025Updated 7 months ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Jan 16, 2020Updated 6 years ago
- ☆11May 4, 2020Updated 5 years ago
- ☆56Jul 17, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/libmfcc☆27Mar 14, 2015Updated 11 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- Computer vision framework based on deep learning and GPU programming☆17Jun 16, 2019Updated 6 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- ☆13May 9, 2022Updated 3 years ago
- Dynamically parse and fill different formats of wav headers.☆11Jan 11, 2024Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- A simple MFCC extractor using C++ STL and C++11☆126Dec 4, 2019Updated 6 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- C library for speech pre-processing.☆12Jun 7, 2019Updated 6 years ago
- 基于Qt的一款截图工具☆11Nov 8, 2016Updated 9 years ago
- ☆29Aug 4, 2018Updated 7 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆17Aug 19, 2025Updated 7 months ago
- ☆10May 24, 2019Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated 11 months ago
- creates a wav file from multiple bin (redump.org format)☆12Apr 19, 2018Updated 7 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Dec 2, 2019Updated 6 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Minerva是一个便捷的音频工具,支持快速进行录音(PCM/MP3/WAV)和VAD端点检测识别,并保存活动语音。☆10May 23, 2024Updated last year
- A lightweight JSON parser and serializer for Qt5 and Qt6☆10Sep 15, 2025Updated 6 months ago
- 从MinerU中提取出来的文本检测识别部分,通过pytorch实现paddleocr的文本检测识别☆17Jun 2, 2025Updated 9 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- Waveform sound playground application for Windows, written in VC++. Generating waveform from specified frequency. FFT analyzer up to 65 k…☆11Mar 11, 2026Updated last week
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- A quick PSNR/SSIM analyzer for Linux☆10Mar 22, 2015Updated 11 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago