A simple example for use speech recognition baidu api with python.
☆115Apr 8, 2021Updated 5 years ago
Alternatives and similar repositories for python-Speech_Recognition
Users that are interested in python-Speech_Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 7 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆31Jan 28, 2018Updated 8 years ago
- ☆13Jun 26, 2015Updated 10 years ago
- A generative method to synthesize free-hand styled sketches from images☆22Jun 19, 2016Updated 9 years ago
- 树莓派语音识别机器人(项目转移到autohome项目)☆230Apr 17, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 语智科技远场(单麦克风)语音识别引擎 FFASR 接入指南☆15Aug 4, 2023Updated 2 years ago
- The wizard of oz code used for collecting goal-oriented dialogue systems☆13Oct 30, 2017Updated 8 years ago
- speaker recognition using keras☆36Nov 29, 2022Updated 3 years ago
- Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).☆18May 8, 2025Updated last year
- A step by step guide on how to use tensorflow serving to serve a tensorflow model.☆26Oct 12, 2022Updated 3 years ago
- 未来杯语音赛道说话人识别的baseline☆49Apr 9, 2019Updated 7 years ago
- Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)☆255Mar 13, 2019Updated 7 years ago
- Sketch Based Image Retrieval☆10Jul 13, 2018Updated 7 years ago
- 【中文语音识别 】【验证码识别】☆120Jun 17, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 根据MFCC提取音频特征,训练“飞鱼秀”音频节目语音和音乐的切割。☆30Dec 28, 2017Updated 8 years ago
- Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow☆52Dec 11, 2016Updated 9 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- ☆13Oct 10, 2017Updated 8 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆870Jun 9, 2021Updated 4 years ago
- Python implementation of A La Carte Embedding☆10Dec 7, 2018Updated 7 years ago
- A journey to explore neural style algorithm: From GPU to Mobile!☆50Oct 10, 2016Updated 9 years ago
- ☆13May 8, 2015Updated 11 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Oct 7, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Wake-Up-Word Keyword Spotting implemented in Keras☆35Oct 1, 2017Updated 8 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 6 years ago
- An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three ne…☆29May 1, 2019Updated 7 years ago
- Fine-tune Inception v3 for muli-label classification on HICO dataset in TensorFlow☆24Oct 4, 2017Updated 8 years ago
- Segmentation algorithm for MIREX 2014☆14Dec 16, 2015Updated 10 years ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆13Mar 20, 2014Updated 12 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Convert torch model to/from caffe model easily☆12Nov 29, 2016Updated 9 years ago
- Measuring room impulse responses with python and sounddevice☆85Jun 30, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于OpenCV深度学习与face++共同识别人脸项目☆29Oct 16, 2017Updated 8 years ago
- ☆12Oct 25, 2020Updated 5 years ago
- Ossian: A simple language-independent Text-to-speech frontend☆17Mar 1, 2018Updated 8 years ago
- A collection of minimal examples for the sparta plug-ins.☆13Jul 12, 2025Updated 10 months ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- ☆10Apr 7, 2022Updated 4 years ago
- ☆13Aug 13, 2023Updated 2 years ago