IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
β76Updated 3 weeks ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Create a custom Watson Speech to Text model using specialized domain dataβ60Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated 2 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatβ¦β26Updated 5 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Generate English-language text similar to the news articles in the One Billion Words data set.β26Updated 3 weeks ago
- Automatic Speech Recognition Dataset Generationβ37Updated 6 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- Generate embedding vectors from audio filesβ59Updated 3 weeks ago
- Command line tool to create corpora for Common Voiceβ76Updated last year
- Pytorch implementation of Deepmind's WaveRNN modelβ119Updated 5 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.β54Updated 6 years ago
- A simple audio feature extraction libraryβ80Updated 5 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 2 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Speaker diarization via transfer learningβ27Updated 6 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β33Updated 7 years ago
- Identify sounds in short audio clipsβ154Updated 3 weeks ago
- Python library for handling audio datasets.β138Updated last year
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.β48Updated 6 years ago
- Mozilla deepspeech server implemented in django.β49Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ178Updated 3 years ago
- Project involving Voice Signal Processing of users to recognise them using Voice Biometricsβ37Updated 6 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ22Updated 5 years ago