groundcat / Google-AI-video-transcribe-subtitle-generatorLinks
Transcribes video using GCP speech-to-text and generates .SRT subtitles
☆16Updated 2 years ago
Alternatives and similar repositories for Google-AI-video-transcribe-subtitle-generator
Users that are interested in Google-AI-video-transcribe-subtitle-generator are comparing it to the libraries listed below
Sorting:
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 11 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- An end to end ASR Transformer model training repo☆13Updated 4 years ago
- Google Chrome Extension that allows you to detect photoshopped images using a CNN.☆11Updated 7 years ago
- Generate subtitle files with timelines in an automatic way.☆62Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 6 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆20Updated 2 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆37Updated 2 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 5 years ago
- Sample implementation of natural language image search with OpenAI's CLIP and Elasticsearch or Opensearch.☆73Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Unsupervised video dubbing project☆40Updated 5 years ago
- Convert ppt to video with audio track, using text to speech synthesis☆68Updated 7 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 5 years ago
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22Updated 4 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- openai/whisper + extra features☆89Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆53Updated 3 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Updated 3 years ago
- Predict the speaker's gender from an audio file (Flask API included)☆20Updated 2 years ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆72Updated 2 years ago
- A sketch extractor for anime/illustration.☆18Updated 4 years ago
- Daybreak APP release☆14Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆67Updated 3 years ago
- Autonomous video editing powered by Computer Vision and Motion Detection☆17Updated 2 years ago
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago