Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆8,960Mar 8, 2026Updated this week
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,739Jun 19, 2025Updated 8 months ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,176Jan 17, 2024Updated 2 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,339Sep 22, 2025Updated 5 months ago
- Python interface to CMU Sphinxbase and Pocketsphinx libraries☆373Jun 27, 2023Updated 2 years ago
- A small speech recognizer☆4,277Updated this week
- Manipulate audio with a simple and easy high level interface☆9,744Jul 26, 2025Updated 7 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,839Mar 24, 2023Updated 2 years ago
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆4,012Oct 8, 2021Updated 4 years ago
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,230Aug 4, 2025Updated 7 months ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,315Updated this week
- End-to-End Speech Processing Toolkit☆9,755Mar 5, 2026Updated last week
- Python library for audio and music analysis☆8,240Feb 20, 2026Updated 3 weeks ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,446Jan 12, 2026Updated 2 months ago
- Offline Text To Speech synthesis for python☆2,491Mar 2, 2026Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆95,882Dec 15, 2025Updated 2 months ago
- Deep Learning for humans☆63,977Updated this week
- The world's simplest facial recognition api for Python and the command line☆56,183Aug 21, 2024Updated last year
- A PyTorch-based Speech Toolkit☆11,302Mar 1, 2026Updated last week
- Models and examples built with TensorFlow☆77,691Mar 6, 2026Updated last week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,345Feb 22, 2026Updated 2 weeks ago
- Python library and CLI tool to interface with Google Translate's text-to-speech API☆2,592Dec 15, 2025Updated 2 months ago
- ChatterBot is a machine learning, conversational dialog engine for creating chat bots☆14,483Feb 17, 2026Updated 3 weeks ago
- Face recognition with deep neural networks.☆15,407Oct 4, 2024Updated last year
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,360Sep 6, 2025Updated 6 months ago
- 💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, …☆21,086Jan 29, 2026Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,763Aug 16, 2024Updated last year
- Topic Modelling for Humans☆16,373Nov 1, 2025Updated 4 months ago
- Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy☆3,351Oct 13, 2021Updated 4 years ago
- Library for fast text representation and classification.☆26,504Mar 22, 2024Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,318Updated this week
- Speech Recognition using DeepSpeech2.☆2,140Dec 13, 2022Updated 3 years ago
- Video editing with Python☆14,416Mar 7, 2026Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,191Sep 30, 2025Updated 5 months ago
- An Open Source Machine Learning Framework for Everyone☆194,137Updated this week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,148Mar 1, 2026Updated last week
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆59,512Updated this week
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,514Mar 3, 2026Updated last week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,783Updated this week