IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
☆76Updated last week
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆60Updated 3 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- speaker diarization system using an LSTM☆50Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 10 months ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.☆54Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆380Updated 2 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20Updated 5 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- ESPnet Model Zoo☆250Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- A simple audio feature extraction library☆80Updated 5 years ago