Fcabla / whisper_subtitlerLinks
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models and pyannote/nemo models in order to identify different speakers.
☆18Updated 2 years ago
Alternatives and similar repositories for whisper_subtitler
Users that are interested in whisper_subtitler are comparing it to the libraries listed below
Sorting:
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆186Updated 2 years ago
- Live-Transcription (STT) with Whisper PoC☆189Updated last year
- Bulk summarization of documents using ChatGPT API☆123Updated 6 months ago
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 5 months ago
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆18Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆156Updated last year
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- OpenAI Whisper API-style local server, runnig on FastAPI☆83Updated 7 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆222Updated 3 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆171Updated 2 years ago
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- ☆232Updated last year
- Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.☆117Updated last year
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)☆76Updated 2 years ago
- ☆257Updated 2 years ago
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 3 years ago
- The PDFChat app allows you to chat with your PDF files in natural language.☆137Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆31Updated last year
- A self-hostable reverse engineering of Quora's Poe.com API, allowing free access to ChatGPT and Anthropic's Claude AI☆67Updated 2 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- 🎥 Youtube Video Summarizer and Question Answering App Using Whisper and Langchain☆94Updated last year
- Generate subtitles for long movies / podcasts with OpenAI Whisper API.☆30Updated last year
- ☆47Updated last year
- Reverse engineered API of Stable Diffusion XL 1.0 ( Midjourney Alternative ), A text-to-image generative AI model that creates beautiful …☆40Updated last year
- 本工具是python tkinter编写的一个简单的Gui,任务批量管理器。通过Gui选项生成*CMD*(command),来调用whisper,达到批量生成,管理的目的。支持whisper和whisperx☆58Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆119Updated last week