jameslyons/python_speech_features

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jameslyons/python_speech_features)

jameslyons / python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

☆2,423

Alternatives and similar repositories for python_speech_features

Users that are interested in python_speech_features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,401Mar 14, 2022Updated 4 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,439Sep 22, 2025Updated 10 months ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
tyiannak / pyAudioAnalysis
View on GitHub
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,254Aug 4, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
astorfi / speechpy
View on GitHub
SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
☆883Dec 15, 2024Updated last year
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,903Updated this week
wiseman / py-webrtcvad
View on GitHub
Python interface to the WebRTC Voice Activity Detector
☆2,495Jul 4, 2024Updated 2 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
mravanelli / SincNet
View on GitHub
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,241Apr 28, 2021Updated 5 years ago
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
View on GitHub
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,127Oct 19, 2023Updated 2 years ago
librosa / librosa
View on GitHub
Python library for audio and music analysis
☆8,521Updated this week
KarelVesely84 / kaldi-io-for-python
View on GitHub
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆378Jun 16, 2023Updated 3 years ago
crouchred / speaker-recognition-py3
View on GitHub
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
☆254Mar 13, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
pannous / tensorflow-speech-recognition
View on GitHub
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
☆2,173Jan 17, 2024Updated 2 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
philipperemy / deep-speaker
View on GitHub
Deep Speaker: an End-to-End Neural Speaker Embedding System.
☆941Apr 13, 2024Updated 2 years ago
ppwwyyxx / speaker-recognition
View on GitHub
A Speaker Recognition System
☆677Apr 20, 2020Updated 6 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
zzw922cn / Automatic_Speech_Recognition
View on GitHub
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,834Mar 24, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
santi-pdp / segan
View on GitHub
Speech Enhancement Generative Adversarial Network in TensorFlow
☆862Mar 24, 2023Updated 3 years ago
pytorch / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆2,917Updated this week
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
r9y9 / pysptk
View on GitHub
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆451Jul 16, 2024Updated 2 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
fgnt / nara_wpe
View on GitHub
Different implementations of "Weighted Prediction Error" for speech dereverberation
☆569Mar 19, 2025Updated last year
awni / speech
View on GitHub
A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆768Jul 6, 2023Updated 3 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated 2 weeks ago
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
WeidiXie / VGG-Speaker-Recognition
View on GitHub
Utterance-level Aggregation For Speaker Recognition In The Wild
☆371Mar 24, 2023Updated 3 years ago
yongxuUSTC / sednn
View on GitHub
deep learning based speech enhancement using keras or pytorch, make it easy to use
☆339Feb 26, 2020Updated 6 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,886Jul 7, 2026Updated 2 weeks ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated last month