Uberi / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
β8,769Updated last month
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflowβ3,986Updated 3 years ago
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,170Updated last year
- Python library and CLI tool to interface with Google Translate's text-to-speech APIβ2,472Updated 2 weeks ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,911Updated last month
- A small speech recognizerβ4,135Updated last week
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,432Updated 7 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,272Updated 11 months ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,977Updated last year
- Manipulate audio with a simple and easy high level interfaceβ9,432Updated 10 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,840Updated 2 years ago
- A Python wrapper for Kaldiβ1,017Updated 5 months ago
- A TensorFlow implementation of DeepMind's WaveNet paperβ5,438Updated last year
- Speech Recognition using DeepSpeech2.β2,124Updated 2 years ago
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applicationsβ6,070Updated 2 months ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,451Updated this week
- Offline Text To Speech synthesis for pythonβ2,360Updated last week
- Python interface to CMU Sphinxbase and Pocketsphinx librariesβ373Updated last year
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)β2,675Updated last year
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,085Updated last year
- Python library for audio and music analysisβ7,700Updated last week
- Deep neural networks for voice conversion (voice style transfer) in Tensorflowβ3,938Updated 2 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Modelβ1,832Updated 3 years ago
- WaveNet vocoderβ2,357Updated last year
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure javaβ2,495Updated 5 months ago
- A Speaker Recognition Systemβ677Updated 5 years ago
- Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboyβ3,246Updated 3 years ago
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,892Updated last week
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,981Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,884Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorchβ2,678Updated this week