mfcc, mel, pcen. (librosa)
☆36Nov 20, 2019Updated 6 years ago
Alternatives and similar repositories for MFCC
Users that are interested in MFCC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc☆239Dec 28, 2020Updated 5 years ago
- C code to extract mfcc or fbank features from wav files☆17Oct 25, 2019Updated 6 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Jan 14, 2022Updated 4 years ago
- implementing beamforming algorithm in C++☆11Jan 9, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A C++ implementation of stft, melspectrogram and mel_to_stft☆11Jun 2, 2022Updated 4 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆31May 6, 2021Updated 5 years ago
- ☆23Apr 6, 2016Updated 10 years ago
- 单独移植编译webrtc的aec模块☆22Aug 30, 2018Updated 7 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- ☆24Mar 18, 2024Updated 2 years ago
- Create MP4 videos from JPG/PNG/GIF/BMP images☆14Feb 21, 2015Updated 11 years ago
- dlib implementation of Siamese Network Training with Caffe☆11Mar 7, 2018Updated 8 years ago
- 音频特征提取程序,MFCC,HFCC,MFCC_WALSH,Philips☆31Mar 31, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Jun 27, 2018Updated 8 years ago
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- Automatic system for audio denoising by wavelet transform.☆13Jun 13, 2016Updated 10 years ago
- ☆29Jul 9, 2022Updated 3 years ago
- Yolact running on the ncnn framework on a bare Raspberry Pi 4 with 64 OS, overclocked to 1950 MHz☆12Dec 29, 2022Updated 3 years ago
- Simple ALSA app for looping audio from capture to playback☆21Mar 8, 2013Updated 13 years ago
- Streaming Audiotransformers for online Audio tagging☆57Jun 14, 2024Updated 2 years ago
- Python toolbox for decorrelating and upmixing audio signals.☆37Sep 11, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- 用rtaudio来采集、播放,并用speexdsp来做回声消除。☆19Jun 30, 2018Updated 8 years ago
- Talking Face Generation system☆19Oct 16, 2023Updated 2 years ago
- ☆16Apr 4, 2022Updated 4 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆18Aug 26, 2025Updated 10 months ago
- Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine☆11Mar 1, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tools to convert sigsep mus dataset from STEMS <-> WAV☆12Jul 15, 2020Updated 5 years ago
- The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation☆48Feb 14, 2019Updated 7 years ago
- Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)☆16Sep 1, 2024Updated last year
- A simple MFCC extractor using C++ STL and C++11☆126Dec 4, 2019Updated 6 years ago
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆101Dec 14, 2024Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆44Nov 10, 2021Updated 4 years ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago