groundcat / Google-AI-video-transcribe-subtitle-generatorLinks
Transcribes video using GCP speech-to-text and generates .SRT subtitles
☆16Updated 2 years ago
Alternatives and similar repositories for Google-AI-video-transcribe-subtitle-generator
Users that are interested in Google-AI-video-transcribe-subtitle-generator are comparing it to the libraries listed below
Sorting:
- Deploy DL/ ML inference pipelines with minimal extra code.☆99Updated 11 months ago
- This repository summaries publications on Recognition of Handwritten Mathematical Expressions☆15Updated 7 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Updated 3 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆19Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22Updated 4 years ago
- 用 OCR 提取视频硬字幕☆81Updated 8 months ago
- Implementation of the DocLLM paper for Llama models.☆13Updated 6 months ago
- Sample implementation of natural language image search with OpenAI's CLIP and Elasticsearch or Opensearch.☆73Updated 3 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Updated 5 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Curriculum Vitae of Quan Wang☆15Updated last month
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Best Collection of Articles and code for Audio Classification☆15Updated 6 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated 2 years ago
- Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。☆48Updated 4 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 8 months ago
- SpeechYOLO Interspeech 2019☆44Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 5 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated 2 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- chinese real time voice cloning☆38Updated 5 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 7 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- Speaker prediction for captions on the Lex Fridman podcast☆27Updated last year