IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
☆76Updated 2 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆103Updated 5 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆48Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆120Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 4 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 4 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Updated 4 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Updated 5 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆171Updated 6 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 3 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Updated 4 years ago
- Speaker diarization scripts, based on AaltoASR☆191Updated 6 years ago
- Build your own Real-time Speech Emotion Recognizer☆115Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆181Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- ☆65Updated 2 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆49Updated 6 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Identify sounds in short audio clips☆156Updated 2 months ago
- A testing server for a speech to text service based on coqui.ai☆219Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆122Updated 6 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.☆52Updated 6 years ago
- Python SDK for the Microsoft Speaker Recognition API, part of Cognitive Services☆110Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago