求取语音的MFCC参数和GFCC参数,可用于语音信号特征提取
☆10Jul 19, 2021Updated 4 years ago
Alternatives and similar repositories for Speech_MFCC_GFCC_Python
Users that are interested in Speech_MFCC_GFCC_Python are comparing it to the libraries listed below
Sorting:
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- A pytorch-based implementation of Dirichlet Process Mixture Model (DPMM)☆13Nov 27, 2025Updated 3 months ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Dec 2, 2024Updated last year
- 一个完整的聊天UI框架,数据库结构已完善,实现了发送文本、图片、视频、语音消息的功能,消息类型分离,具有高度的可扩展性,并且对行高进行缓存,解决了滑动时卡顿的问题。同时,表情键盘也做了完善的处理,比如:富文本的输入、显示与删除,表情图片的完整性与可替换性较高。☆16Jul 21, 2019Updated 6 years ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- kaldi based x-vector trained on Cn-Celeb☆13Sep 22, 2020Updated 5 years ago
- 整合了说话人识别和语音分离的数据集预处理,模型加载交互(基于TIMIT数据集)☆17Apr 22, 2021Updated 4 years ago
- Repo for wavelet based edge detection in Python☆13Jan 4, 2016Updated 10 years ago
- ☆16Aug 30, 2023Updated 2 years ago
- bp神经网络调节PID☆10Apr 19, 2023Updated 2 years ago
- Autoencoder(AE) based methods for anomalous sound detection(ASD)☆14Jan 10, 2023Updated 3 years ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- introduction to iVectors with available speech data☆11Mar 4, 2016Updated 10 years ago
- ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃☆21Apr 16, 2024Updated last year
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 6 years ago
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆22Nov 2, 2024Updated last year
- I-Vector Speaker recognition system implemented with MSRIT in matlab☆15Jan 12, 2016Updated 10 years ago
- PySVM : A NumPy implementation of SVM based on SMO algorithm. Numpy构建SVM分类、回归与单分类,支持缓存机制与随机傅里叶特征☆27Nov 19, 2023Updated 2 years ago
- An Adaptive Multi-Channel Attention Method for Fault Diagnosis☆19Dec 27, 2023Updated 2 years ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆108Feb 21, 2023Updated 3 years ago
- ☆19Jun 25, 2012Updated 13 years ago
- 【IEEE IoTJ 2024】Heterogeneous Federated Learning: Client-side Collaborative Update Inter-Domain Generalization Method for Intelligent Fau…☆14Jun 23, 2025Updated 8 months ago
- Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆19Jul 8, 2025Updated 8 months ago
- ☆69Jul 17, 2024Updated last year
- ☆49Jun 16, 2025Updated 9 months ago
- ☆26Jun 5, 2024Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention - Accepted in …☆48Jun 7, 2025Updated 9 months ago
- Alternative version of st.camera_input which returns the webcam images live, without any button press needed☆39Aug 4, 2025Updated 7 months ago
- Autoencoder-based baseline system for DCASE2021 Challenge Task 2.☆27Jun 9, 2021Updated 4 years ago
- ☆13Sep 25, 2024Updated last year
- matlab 说话人语音识别☆22Jul 22, 2017Updated 8 years ago
- Source code for "FedSoft: Soft Clustered Federated Learning with Proximal Local Updating"☆20Apr 28, 2022Updated 3 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Feb 22, 2021Updated 5 years ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆81Dec 30, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆24Dec 20, 2022Updated 3 years ago