Uberi / speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
β8,445Updated last week
Related projects β
Alternatives and complementary repositories for speech_recognition
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflowβ3,954Updated 3 years ago
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,166Updated 10 months ago
- A small speech recognizerβ3,950Updated last month
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,845Updated last year
- Manipulate audio with a simple and easy high level interfaceβ8,956Updated 3 months ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β25,366Updated 2 months ago
- Python library for audio and music analysisβ7,184Updated last month
- Offline Text To Speech synthesis for pythonβ2,141Updated this week
- Video editing with Pythonβ12,595Updated 3 months ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)β2,524Updated 4 months ago
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,390Updated 3 months ago
- Speech Recognition using DeepSpeech2.β2,104Updated last year
- Python module installed with setup.pyβ340Updated 2 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.β2,376Updated 3 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,298Updated last month
- Python library and CLI tool to interface with Google Translate's text-to-speech APIβ2,314Updated this week
- Python interface to the WebRTC Voice Activity Detectorβ2,068Updated 4 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β6,345Updated this week
- Deep neural networks for voice conversion (voice style transfer) in Tensorflowβ3,923Updated 2 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,073Updated 5 months ago
- Python interface to CMU Sphinxbase and Pocketsphinx librariesβ374Updated last year
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ8,137Updated last week
- Examples of how to use or integrate DeepSpeechβ821Updated last year
- An open source library for deep learning end-to-end dialog systems and chatbots.β6,730Updated this week
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,β¦β2,367Updated 2 years ago
- A TensorFlow implementation of DeepMind's WaveNet paperβ5,415Updated last year
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applicationsβ5,887Updated 7 months ago
- TensorFlow CNN for fast style transfer β‘π₯π¨πΌβ10,929Updated last year
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,859Updated 2 years ago
- Face recognition with deep neural networks.β15,145Updated last month