IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
☆76Updated 11 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter:
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Speaker diarization via transfer learning☆27Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.☆54Updated 5 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆118Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆25Updated 5 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Updated 3 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆48Updated 5 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 8 months ago
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- DeepSpeech based forced alignment tool☆234Updated 4 years ago
- Speech Commands Recognition in PyTorch☆34Updated 6 years ago
- Conversational AI Benchmark.☆65Updated last year
- Generate embedding vectors from audio files☆57Updated last year
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆163Updated 6 months ago