Uberi/speech_recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Uberi/speech_recognition)

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

☆8,982

Alternatives and similar repositories for speech_recognition

Users that are interested in speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mozilla / DeepSpeech
View on GitHub
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,770Jun 19, 2025Updated last year
pannous / tensorflow-speech-recognition
View on GitHub
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
☆2,173Jan 17, 2024Updated 2 years ago
bambocher / pocketsphinx-python
View on GitHub
Python interface to CMU Sphinxbase and Pocketsphinx libraries
☆373Jun 27, 2023Updated 3 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,447Sep 22, 2025Updated 10 months ago
cmusphinx / pocketsphinx
View on GitHub
A small speech recognizer
☆4,327Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zzw922cn / Automatic_Speech_Recognition
View on GitHub
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,832Mar 24, 2023Updated 3 years ago
jiaaro / pydub
View on GitHub
Manipulate audio with a simple and easy high level interface
☆9,778Mar 19, 2026Updated 4 months ago
buriburisuri / speech-to-text-wavenet
View on GitHub
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
☆4,005Oct 8, 2021Updated 4 years ago
tyiannak / pyAudioAnalysis
View on GitHub
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,255Aug 4, 2025Updated 11 months ago
nateshmbhat / pyttsx3
View on GitHub
Offline Text To Speech synthesis for python
☆2,524Jul 22, 2026Updated last week
nl8590687 / ASRT_SpeechRecognition
View on GitHub
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
☆8,381Apr 10, 2026Updated 3 months ago
librosa / librosa
View on GitHub
Python library for audio and music analysis
☆8,533Updated this week
pndurette / gTTS
View on GitHub
Python library and CLI tool to interface with Google Translate's text-to-speech API
☆2,624Apr 6, 2026Updated 3 months ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,906Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,439Jul 14, 2026Updated 2 weeks ago
alphacep / vosk-api
View on GitHub
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
☆15,021Jul 2, 2026Updated last month
explosion / spaCy
View on GitHub
💫 Industrial-strength Natural Language Processing (NLP) in Python
☆33,796May 19, 2026Updated 2 months ago
cmusphinx / pocketsphinx-python
View on GitHub
Python module installed with setup.py
☆337Jun 29, 2022Updated 4 years ago
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆106,447Updated this week
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,730Jun 15, 2026Updated last month
ageitgey / face_recognition
View on GitHub
The world's simplest facial recognition api for Python and the command line
☆56,655Jun 25, 2026Updated last month
keras-team / keras
View on GitHub
Deep Learning for humans
☆64,217Updated this week
tensorflow / models
View on GitHub
Models and examples built with TensorFlow
☆77,666Jul 23, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
Kitt-AI / snowboy
View on GitHub
Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy
☆3,364Oct 13, 2021Updated 4 years ago
gunthercox / ChatterBot
View on GitHub
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
☆14,504Jun 19, 2026Updated last month
jameslyons / python_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies.
☆2,423Oct 20, 2021Updated 4 years ago
alumae / kaldi-gstreamer-server
View on GitHub
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,094Jun 8, 2024Updated 2 years ago
cmusatyalab / openface
View on GitHub
Face recognition with deep neural networks.
☆15,431Jul 18, 2026Updated 2 weeks ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,371Jul 24, 2026Updated last week
wiseman / py-webrtcvad
View on GitHub
Python interface to the WebRTC Voice Activity Detector
☆2,495Jul 4, 2024Updated 2 years ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,851Aug 16, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
RasaHQ / rasa
View on GitHub
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, …
☆21,274Jul 24, 2026Updated last week
piskvorky / gensim
View on GitHub
Topic Modelling for Humans
☆16,476Nov 1, 2025Updated 9 months ago
facebookresearch / fastText
View on GitHub
Library for fast text representation and classification.
☆26,547Mar 22, 2024Updated 2 years ago
tensorflow / tensorflow
View on GitHub
An Open Source Machine Learning Framework for Everyone
☆196,664Updated this week
Zulko / moviepy
View on GitHub
Video editing with Python
☆14,824Mar 7, 2026Updated 4 months ago
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
View on GitHub
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,126Oct 19, 2023Updated 2 years ago
sloria / TextBlob
View on GitHub
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
☆9,546Updated this week