IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
☆76Updated 3 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated 2 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆49Updated 6 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Updated 5 years ago
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆120Updated 5 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20Updated 5 years ago
- Identify sounds in short audio clips☆155Updated 3 months ago
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- A testing server for a speech to text service based on coqui.ai☆216Updated 3 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆180Updated 3 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 4 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- Identifying people from small audio fragments☆171Updated 5 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Speaker diarization via transfer learning☆27Updated 6 years ago
- Generate embedding vectors from audio files☆59Updated 3 months ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The o…☆22Updated 7 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Updated 6 years ago
- Emotion_Voice_Recognition_Chainer☆31Updated 9 years ago