abinashmeher999 / voice-data-extract
A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate training data for speech-recognition purposes.
☆19Updated last year
Alternatives and similar repositories for voice-data-extract:
Users that are interested in voice-data-extract are comparing it to the libraries listed below
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Phonetic and phonological vocoding platform☆16Updated 8 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- Text-based media editing interface☆16Updated 7 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- An automatic movie trailer generator.☆41Updated 2 years ago
- Practice your speech level in any language using speech recognition☆33Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- ADS Project☆14Updated 9 years ago
- An online audio-to-text transcription tool☆15Updated 7 years ago
- 🎶 Create sounds, notes, and music with machine learning algorithms interactively.☆14Updated 2 years ago
- Tools for working with the CMU Pronunciation Dictionary☆35Updated 7 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 11 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 7 years ago
- Allows you to edit videos automatically☆44Updated 3 years ago
- Deeplearing based Reverse Image Search using Annoy library☆17Updated 6 years ago
- A curated list of papers exploring the limits of deep learning for NLP☆23Updated 7 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆14Updated last year
- Resources on AI applications in the music domain☆16Updated 2 weeks ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.☆13Updated last year
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Updated 9 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- A very basic demonstration connecting speech recognition and text-to-speech☆19Updated 4 years ago
- Deepiracy - Video piracy detection by using neural networks and string algorithms.☆33Updated 6 years ago