Fcabla / whisper_subtitlerView external linksLinks
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models and pyannote/nemo models in order to identify different speakers.
☆19Mar 10, 2023Updated 2 years ago
Alternatives and similar repositories for whisper_subtitler
Users that are interested in whisper_subtitler are comparing it to the libraries listed below
Sorting:
- ☆11Feb 10, 2024Updated 2 years ago
- Real-time multi-person pose estimation☆22Oct 19, 2018Updated 7 years ago
- ☆11Jul 27, 2021Updated 4 years ago
- Official code for ICCV 2023 paper: GAIT: Generating Aesthetic Indoor Tours with Deep Reinforcement Learning☆12Dec 31, 2023Updated 2 years ago
- Script parses Interactive Brokers trade report to aid in Finnish tax report fill☆13Jan 10, 2024Updated 2 years ago
- GMOT-40: A Benchmark for Generic Multiple Object Tracking (CVPR 2021)☆40Apr 3, 2025Updated 10 months ago
- A cog implementation of mPLUG-Owl🦉, a multimodal large language model☆11May 12, 2023Updated 2 years ago
- A repository to mass generate deepfake video based on DeepFaceLab repository.☆10Aug 10, 2023Updated 2 years ago
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆11Dec 17, 2023Updated 2 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- Concurrent TikTok video downloader without watermark. (Snaptik)☆13Dec 16, 2023Updated 2 years ago
- ☆11Apr 25, 2023Updated 2 years ago
- Prompt Free, Soul Driven AI Assistant☆29Feb 8, 2026Updated last week
- a version tools. face detector,face landmark detector,face parsing and so on☆12Jul 30, 2022Updated 3 years ago
- 歇斯底里的双色球!But Chat with LLM☆11Mar 9, 2024Updated last year
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 7 months ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- ☆11Mar 1, 2024Updated last year
- ☆12Mar 25, 2024Updated last year
- A cross-platform ZeroTier desktop client. Build with Tauri, Rust, Vite, React, Zustand, Next UI and Tailwind CSS☆10Oct 24, 2025Updated 3 months ago
- Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrati…☆16Feb 4, 2026Updated last week
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- A simple terminal emulator built with JavaScript☆15Dec 7, 2025Updated 2 months ago
- ☆13Nov 25, 2022Updated 3 years ago
- Experiments for paper untitlted☆14Jul 25, 2020Updated 5 years ago
- Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning☆12Jan 21, 2024Updated 2 years ago
- This is simple implementation of MaskTrack_Box only requiring a bounding box for video object segmentation.☆10Aug 1, 2019Updated 6 years ago
- human pose estimation based on pose tensorflow☆11Dec 18, 2017Updated 8 years ago
- This is the proposal network for MultiPerson Pose Estimation.☆14Oct 21, 2017Updated 8 years ago
- Some LaTeX Tips for Writing Research Papers☆10May 30, 2016Updated 9 years ago
- This repository shows how we deploy docling on aws lambda☆27Jun 10, 2025Updated 8 months ago
- ☆14Oct 11, 2024Updated last year
- Free and open source FileSync software.自由且开源的文件同步软件☆10Feb 4, 2025Updated last year
- ☆10Dec 12, 2023Updated 2 years ago
- ☆11Jun 24, 2022Updated 3 years ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆14Dec 6, 2024Updated last year
- Exploration of World Languages☆19Apr 5, 2024Updated last year