IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
β76Updated last year
Alternatives and similar repositories for MAX-Speech-to-Text-Converter:
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β35Updated last year
- Create a custom Watson Speech to Text model using specialized domain dataβ60Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β34Updated 6 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ19Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.β48Updated 6 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 3 years ago
- Generate a summarized description of a body of textβ27Updated last year
- Experiment with "one-shot learning" techniques to recognize a voice signatureβ24Updated 5 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatβ¦β25Updated 5 years ago
- Web app for keyword spotting using TensorflowJSβ71Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- Web-based tool for straight-forward class annotation of audio filesβ11Updated 4 years ago
- Build your own Real-time Speech Emotion Recognizerβ111Updated 6 years ago
- Tools for working with the CMU Pronunciation Dictionaryβ35Updated 7 years ago
- β16Updated 4 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow modelsβ39Updated 7 months ago
- Automatic Speech Recognition Dataset Generationβ37Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Identify sounds in short audio clipsβ154Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Spoken Language assessmentβ42Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 10 months ago