groundcat / Google-AI-video-transcribe-subtitle-generator
Transcribes video using GCP speech-to-text and generates .SRT subtitles
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Google-AI-video-transcribe-subtitle-generator
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 2 months ago
- An end to end ASR Transformer model training repo☆14Updated 2 years ago
- repo for active speaker detection for media videos.☆21Updated last year
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- Curriculum Vitae of Quan Wang☆14Updated this week
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆17Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated 5 months ago
- convert subtitle (.srt) to speech (.wav) using google API☆38Updated 2 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆16Updated 4 years ago
- Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。☆45Updated 3 years ago
- ☆45Updated 4 months ago
- Uses machine learning to denoise audio containing speech☆29Updated 4 months ago
- Unsupervised video dubbing project☆38Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆14Updated 2 years ago
- Generate subtitle files with timelines in an automatic way.☆62Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆18Updated 5 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 10 years ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated last year
- 基于uvr5的歌唱人声分离☆25Updated 2 years ago
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- chinese real time voice cloning☆39Updated 4 years ago