kingback2019/Speech_MFCC_GFCC_Python

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kingback2019/Speech_MFCC_GFCC_Python)

kingback2019 / Speech_MFCC_GFCC_Python

求取语音的MFCC参数和GFCC参数，可用于语音信号特征提取

☆10

Alternatives and similar repositories for Speech_MFCC_GFCC_Python

Users that are interested in Speech_MFCC_GFCC_Python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhaoyi2 / xvector-cnceleb
View on GitHub
kaldi based x-vector trained on Cn-Celeb
☆13Sep 22, 2020Updated 5 years ago
danielecastellana22 / torch-dpmm
View on GitHub
A pytorch-based implementation of Dirichlet Process Mixture Model (DPMM)
☆13Apr 20, 2026Updated 3 months ago
tmlr-group / DAL
View on GitHub
[NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"
☆11Nov 14, 2023Updated 2 years ago
xiaoyi3699 / LLChat
View on GitHub
一个完整的聊天UI框架，数据库结构已完善，实现了发送文本、图片、视频、语音消息的功能，消息类型分离，具有高度的可扩展性，并且对行高进行缓存，解决了滑动时卡顿的问题。同时，表情键盘也做了完善的处理，比如：富文本的输入、显示与删除，表情图片的完整性与可替换性较高。
☆16Jul 21, 2019Updated 7 years ago
aravindr18 / Wavelet-Based-Edge-Detection
View on GitHub
Repo for wavelet based edge detection in Python
☆12Jan 4, 2016Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yujiacheng333 / Speech-Experiment
View on GitHub
整合了说话人识别和语音分离的数据集预处理，模型加载交互（基于TIMIT数据集）
☆17Apr 22, 2021Updated 5 years ago
nikvaessen / w2v2-speaker-few-samples
View on GitHub
Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688
☆13Dec 2, 2024Updated last year
wangwei2009 / MSR-Identity-Toolkit-v1.0
View on GitHub
MSR Identity Toolkit v1.0
☆16Aug 18, 2017Updated 8 years ago
liuyoude / AE-ASD
View on GitHub
Autoencoder(AE) based methods for anomalous sound detection(ASD)
☆13Jan 10, 2023Updated 3 years ago
yaoweibinoo / BP_PID
View on GitHub
bp神经网络调节PID
☆11Apr 19, 2023Updated 3 years ago
izlandman / iVector
View on GitHub
introduction to iVectors with available speech data
☆11Mar 4, 2016Updated 10 years ago
nttcslab / ToyADMOS2-dataset
View on GitHub
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃
☆21Apr 16, 2024Updated 2 years ago
WeltXing / PySVM
View on GitHub
PySVM : A NumPy implementation of SVM based on SMO algorithm. Numpy构建SVM分类、回归与单分类，支持缓存机制与随机傅里叶特征
☆27Nov 19, 2023Updated 2 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xieyuankun / TDL-ADD
View on GitHub
This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…
☆22Nov 2, 2024Updated last year
pedrocolon93 / ivectormatlabmsrit
View on GitHub
I-Vector Speaker recognition system implemented with MSRIT in matlab
☆15Jan 12, 2016Updated 10 years ago
SilvrDuck / AccentedSpeechRecognition
View on GitHub
Experiments on speech recognition robustness to accents and dialects
☆12Apr 2, 2019Updated 7 years ago
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 8 months ago
tokheim / iVector
View on GitHub
☆19Jun 25, 2012Updated 14 years ago
Kevinnan-teen / Speaker-Recognition
View on GitHub
说话人识别（声纹识别）算法的Python实现。包括GMM（已完成）、GMM-UBM、ivector、基于深度学习的声纹识别（self-attention已完成）。
☆108Feb 21, 2023Updated 3 years ago
danny95928 / pca-ResNet-fault-diagnosis
View on GitHub
An Adaptive Multi-Channel Attention Method for Fault Diagnosis
☆22Dec 27, 2023Updated 2 years ago
What-a-mess / Wind-Turbine-SCADA-Anomaly-Detection
View on GitHub
☆18Aug 30, 2023Updated 2 years ago
JC952 / P2PCHF
View on GitHub
【IEEE IoTJ 2024】Heterogeneous Federated Learning: Client-side Collaborative Update Inter-Domain Generalization Method for Intelligent Fau…
☆14Jun 23, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
View on GitHub
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
☆19Jul 8, 2025Updated last year
lovemefan / telespeech-asr-python
View on GitHub
☆68Jul 17, 2024Updated 2 years ago
tianyuan168326 / All-in-One-MedReID-Pytorch
View on GitHub
Official Code for All-in-One Medical Image Re-Identification (CVPR2025)
☆20Jan 11, 2026Updated 6 months ago
3140102441 / speak-recognization
View on GitHub
matlab 说话人语音识别
☆22Jul 22, 2017Updated 8 years ago
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
Dapwner / CVAE-Tacotron
View on GitHub
☆26Jun 5, 2024Updated 2 years ago
blackary / streamlit-camera-input-live
View on GitHub
Alternative version of st.camera_input which returns the webcam images live, without any button press needed
☆39Aug 4, 2025Updated 11 months ago
y-kawagu / dcase2021_task2_baseline_ae
View on GitHub
Autoencoder-based baseline system for DCASE2021 Challenge Task 2.
☆27Jun 9, 2021Updated 5 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
genzen2103 / Speaker-Recognition-System-using-GMM
View on GitHub
System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models
☆21Nov 5, 2017Updated 8 years ago
ycruan / FedSoft
View on GitHub
Source code for "FedSoft: Soft Clustered Federated Learning with Proximal Local Updating"
☆20Apr 28, 2022Updated 4 years ago
insunhwang89 / StyleVC
View on GitHub
☆33Jan 14, 2023Updated 3 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
markwwen / ServingAgent
View on GitHub
A simple middleware to improving GPU utilization then speedup online inference.
☆19Feb 22, 2021Updated 5 years ago
zilogo / tradingagents-cn
View on GitHub
基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版
☆21Dec 21, 2025Updated 7 months ago