Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆8,953Jan 2, 2026Updated last month
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,727Jun 19, 2025Updated 8 months ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,176Jan 17, 2024Updated 2 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,331Sep 22, 2025Updated 5 months ago
- Python interface to CMU Sphinxbase and Pocketsphinx libraries☆373Jun 27, 2023Updated 2 years ago
- A small speech recognizer☆4,277Updated this week
- Manipulate audio with a simple and easy high level interface☆9,740Jul 26, 2025Updated 7 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,840Mar 24, 2023Updated 2 years ago
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆4,009Oct 8, 2021Updated 4 years ago
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,221Aug 4, 2025Updated 6 months ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- End-to-End Speech Processing Toolkit☆9,747Updated this week
- Python library for audio and music analysis☆8,227Feb 20, 2026Updated last week
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,446Jan 12, 2026Updated last month
- Offline Text To Speech synthesis for python☆2,486Dec 16, 2025Updated 2 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆95,206Dec 15, 2025Updated 2 months ago
- Deep Learning for humans☆63,866Updated this week
- The world's simplest facial recognition api for Python and the command line☆56,145Aug 21, 2024Updated last year
- A PyTorch-based Speech Toolkit☆11,243Feb 11, 2026Updated 2 weeks ago
- Models and examples built with TensorFlow☆77,693Updated this week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,301Feb 22, 2026Updated last week
- Python library and CLI tool to interface with Google Translate's text-to-speech API☆2,586Dec 15, 2025Updated 2 months ago
- ChatterBot is a machine learning, conversational dialog engine for creating chat bots☆14,477Feb 17, 2026Updated last week
- Face recognition with deep neural networks.☆15,402Oct 4, 2024Updated last year
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,349Sep 6, 2025Updated 5 months ago
- 💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, …☆21,070Jan 29, 2026Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,608Aug 16, 2024Updated last year
- Topic Modelling for Humans☆16,361Nov 1, 2025Updated 4 months ago
- Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy☆3,348Oct 13, 2021Updated 4 years ago
- Library for fast text representation and classification.☆26,502Mar 22, 2024Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,245Feb 20, 2026Updated last week
- Speech Recognition using DeepSpeech2.☆2,139Dec 13, 2022Updated 3 years ago
- Video editing with Python☆14,372Sep 25, 2025Updated 5 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,170Sep 30, 2025Updated 5 months ago
- An Open Source Machine Learning Framework for Everyone☆193,905Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,130Feb 1, 2026Updated last month
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆59,373Dec 15, 2025Updated 2 months ago
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,515Feb 16, 2026Updated last week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,421Oct 20, 2021Updated 4 years ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,071Updated this week