IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
☆76Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for MAX-Speech-to-Text-Converter
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆101Updated 4 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆25Updated 5 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Code for AccentDB.☆19Updated 3 years ago
- Command line tool to create corpora for Common Voice☆75Updated 5 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 10 years ago
- A simple web interface for building voice assistants with Rasa☆174Updated last year
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Grapheme To Phoneme☆70Updated 3 months ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 6 months ago
- Wheels for tensorflow and DeepSpeech compiled for NVidia Jetson Nano (arm64)☆89Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Updated 3 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Updated 5 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆36Updated 2 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆128Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- speaker diarization system using an LSTM☆49Updated last year