IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
โ76Updated last year
Alternatives and similar repositories for MAX-Speech-to-Text-Converter:
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
- Create a custom Watson Speech to Text model using specialized domain dataโ59Updated 3 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.โ129Updated 3 years ago
- ๐ธTTS recipes for different datasetsโ85Updated 2 years ago
- Speaker diarization via transfer learningโ27Updated 5 years ago
- โ65Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkโ47Updated last year
- ๐ฎ Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.โ170Updated 5 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.โ48Updated 6 years ago
- A Collection of Speech Corpus for ASR and TTSโ113Updated 7 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?โ34Updated 6 years ago
- Speech Commands Recognition in PyTorchโ34Updated 6 years ago
- A collection of basic python modules for spoken natural language processingโ56Updated 5 years ago
- Demos, samples, and experimental code for Lingvo.โ58Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separationโ172Updated 2 years ago
- Command line tool to create corpora for Common Voiceโ75Updated 8 months ago
- Open tools and data for cloudless automatic speech recognitionโ447Updated 3 years ago
- 24-hour Automatic Speech Recognitionโ27Updated 3 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descriptโฆโ28Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xโ72Updated 2 years ago
- ๐ฆ Nala is an agile open-source voice assistant framework (20+ actions).โ35Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxโ13Updated 7 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksโ164Updated 7 months ago
- Scripts for training Kaldi for German speech recognition (ASR).โ24Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.โ24Updated 3 years ago
- Identify sounds in short audio clipsโ153Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsโ102Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksโ64Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneasโ53Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusโ36Updated last year
- Experiment with "one-shot learning" techniques to recognize a voice signatureโ24Updated 4 years ago