语音识别 MFCCs特征处理 cnn神经网络
☆105Jan 22, 2019Updated 7 years ago
Alternatives and similar repositories for phonetic-recognition
Users that are interested in phonetic-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 脸部识别 人眼特征检测 活体检测 人脸旋转与侧脸拉正☆92Jan 22, 2019Updated 7 years ago
- Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)☆255Mar 13, 2019Updated 7 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 6 years ago
- 使用python进行语音识别☆170Feb 16, 2022Updated 4 years ago
- 从webrtc抽离出来的vad源代码,供语音分析/检测使用☆30Oct 31, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 这是一个基于全卷积神经网络的语音识别系统☆79Jun 28, 2019Updated 6 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆221Jul 6, 2023Updated 2 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆44Jul 10, 2018Updated 7 years ago
- 银杏黄项:语音情感识别☆13Nov 6, 2016Updated 9 years ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- 基于YOLOv3和brox光流的运动目标检测算法,对动态背景进行了运动补偿☆15Jul 17, 2019Updated 6 years ago
- 中文语音识别,automatic speech recognition(ASR)☆14Dec 30, 2021Updated 4 years ago
- Speaker Recognition System using MFCC and GMM.☆24Apr 8, 2018Updated 8 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于python3.6+opencv3+tensorflow+cnn的银行卡识别☆13Dec 9, 2022Updated 3 years ago
- 基于双门限识别的语音端点检测系统☆24Jan 16, 2018Updated 8 years ago
- ☆17Apr 26, 2019Updated 6 years ago
- 利用Python+TensorFlow实现语音识别☆48Oct 30, 2018Updated 7 years ago
- 基于MFCC语音特征提取和识别☆75Jul 12, 2015Updated 10 years ago
- api to provide animation details for sample audio☆13Feb 28, 2018Updated 8 years ago
- Speaker recognition and verification with deep learning☆13Mar 7, 2017Updated 9 years ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆107Feb 21, 2023Updated 3 years ago
- ☆12Sep 2, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,968Jul 25, 2024Updated last year
- Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach☆20Jan 5, 2023Updated 3 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆26Aug 29, 2017Updated 8 years ago
- CNN learns feature mapping between corrupted and clean speech☆12Aug 14, 2017Updated 8 years ago
- CASME II: An Improved Spontaneous Micro-Expression Database and the Baseline Evaluation☆10Oct 19, 2018Updated 7 years ago
- This repository applies Deep Learning techniques for depression detection in text, using LSTM, GRU, BiLSTM, BERT models, and a baseline F…☆19Jul 14, 2023Updated 2 years ago
- ☆11Apr 8, 2026Updated last week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…☆17Sep 5, 2016Updated 9 years ago
- 信息检索与数据挖掘相关☆16May 13, 2019Updated 6 years ago
- PyTorch Implementation of Time/Frequency Masks☆12May 22, 2019Updated 6 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆217May 26, 2020Updated 5 years ago
- 符合 OpenAPI 3.0 规范的 Bilibili API 定义。☆18Jan 18, 2026Updated 2 months ago
- Leverage 3D video and Spatial Audio to deliver an immersive experience.☆11Oct 11, 2023Updated 2 years ago
- 基于文本相似度的win10智能客服问答系统☆16Mar 12, 2020Updated 6 years ago