abinashmeher999 / voice-data-extractLinks
A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate training data for speech-recognition purposes.
☆19Updated last year
Alternatives and similar repositories for voice-data-extract
Users that are interested in voice-data-extract are comparing it to the libraries listed below
Sorting:
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Phonetic and phonological vocoding platform☆16Updated 8 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- Tools for working with the CMU Pronunciation Dictionary☆35Updated 7 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 10 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆20Updated 7 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Collaborative annotation of multimedia documents☆12Updated 7 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Deeplearing based Reverse Image Search using Annoy library☆16Updated 6 years ago
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Updated 6 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Documentation for the MixedEmotions Toolbox☆45Updated 7 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Customizable video storyboard generator. (Deprecated. Use https://github.com/zmwangx/metadata.)☆29Updated 6 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- Text-based media editing interface☆16Updated 7 years ago
- End-to-end deep learned Automatic Speech Recognition system☆8Updated 8 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 11 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- ADS Project☆14Updated 9 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- ACTION: Audio-visual Cinematic Toolkit for Interaction, Organization, and Navigation☆11Updated 8 years ago
- Web app for keyword spotting using TensorflowJS☆72Updated 2 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 9 years ago
- A free dataset of (almost) all publicly available podcasts.☆133Updated 10 years ago