groundcat / Google-AI-video-transcribe-subtitle-generatorLinks
Transcribes video using GCP speech-to-text and generates .SRT subtitles
☆16Updated 2 years ago
Alternatives and similar repositories for Google-AI-video-transcribe-subtitle-generator
Users that are interested in Google-AI-video-transcribe-subtitle-generator are comparing it to the libraries listed below
Sorting:
- Generate subtitle files with timelines in an automatic way.☆62Updated 3 years ago
- An end to end ASR Transformer model training repo☆13Updated 4 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆20Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 5 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- Curriculum Vitae of Quan Wang☆15Updated 2 weeks ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Updated 6 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16Updated 3 years ago
- This repository summaries publications on Recognition of Handwritten Mathematical Expressions☆15Updated 8 years ago
- 用 OCR 提取视频硬字幕☆83Updated this week
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆153Updated 3 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 5 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆36Updated 2 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 10 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Convert ppt to video with audio track, using text to speech synthesis☆66Updated 7 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 3 years ago
- Unsupervised video dubbing project☆40Updated 5 years ago
- Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。☆48Updated 4 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- A demo of zh/Chinese Text to Speech system run on CPU in real time. 中文实时语音合成系统Demo☆181Updated 3 years ago