IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
β75Updated last month
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- π A web app to play, visualize, and annotate your audio files for machine learningβ120Updated 5 years ago
- Create a custom Watson Speech to Text model using specialized domain dataβ60Updated 3 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated 2 years ago
- Project involving Voice Signal Processing of users to recognise them using Voice Biometricsβ37Updated 6 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.β54Updated 6 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.β48Updated 6 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ168Updated last year
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β33Updated 7 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatβ¦β26Updated 6 years ago
- Identify sounds in short audio clipsβ155Updated last month
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β98Updated 3 years ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTTβ94Updated last year
- Generate embedding vectors from audio filesβ59Updated last month
- Automatic Speech Recognition Dataset Generationβ37Updated 6 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β35Updated last year
- Speech Commands Recognition in PyTorchβ34Updated 6 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ179Updated 3 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signatureβ24Updated 5 years ago
- Mozilla deepspeech server implemented in django.β49Updated 4 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descriptβ¦β28Updated 5 years ago
- β83Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Tutorial to run TensorFlow 2 on mobile devices: Android, iOS and Browserβ30Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago