Fcabla / whisper_subtitler
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models and pyannote/nemo models in order to identify different speakers.
☆18Updated last year
Alternatives and similar repositories for whisper_subtitler:
Users that are interested in whisper_subtitler are comparing it to the libraries listed below
- Create Youtube SRT with WhisperX using Google Colab☆18Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆33Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆141Updated 11 months ago
- Convert SubRip to speech using Microsoft Edge's TTS service☆52Updated last month
- Translated vocal synthesis - Clone a voice and output speech in another language☆23Updated 2 years ago
- Reverse engineered API of Stable Diffusion XL 1.0 ( Midjourney Alternative ), A text-to-image generative AI model that creates beautiful …☆41Updated last year
- Transcribe with ease :D☆14Updated last year
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆27Updated 11 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆101Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆18Updated 3 months ago
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models (GPT, API2D GPT4, Cluade) as text inputs…☆86Updated 3 months ago
- Incredibly descriptive audiovisual summaries for videos☆40Updated 5 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆117Updated last year
- openai/whisper + extra features☆88Updated 2 years ago
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year
- Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Googl…☆86Updated 6 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.☆108Updated last year
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆19Updated last year
- FastAPI service on top of WhisperX☆66Updated this week
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆175Updated last year
- OpenAI API and Whisper based Video Translation☆72Updated last month
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆39Updated 5 months ago
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)☆76Updated last year
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆30Updated 2 weeks ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated last year
- ☆228Updated last year