Find out who said what in the video.
☆135Jan 22, 2026Updated last month
Alternatives and similar repositories for whisperVideo
Users that are interested in whisperVideo are comparing it to the libraries listed below
Sorting:
- Muti-human Interactive Talking Dataset☆69Aug 6, 2025Updated 7 months ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆99Jan 1, 2026Updated 2 months ago
- ☆82Feb 24, 2026Updated last week
- [CVPR 2025] DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆30May 13, 2025Updated 9 months ago
- Cog wrapper for MagicAnimate☆31Dec 8, 2023Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- 检测透视图像中的矩形文档并对其进行矫正☆31Sep 16, 2022Updated 3 years ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆307Dec 15, 2025Updated 2 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆74Jan 14, 2026Updated last month
- Save Image with more file formats for ComfyUI☆34Mar 27, 2024Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆37Oct 11, 2024Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Epic Games Free Games Script that send a Webhook when a new Free Games is Available☆11Nov 13, 2023Updated 2 years ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- ☆37May 28, 2025Updated 9 months ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated 11 months ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆113Jul 27, 2025Updated 7 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆24Jan 21, 2026Updated last month
- SkillX.sh — The Only Skill That Your AI Agent Needs. AI agent skills marketplace with semantic search, leaderboard, ratings, and CLI.☆24Feb 13, 2026Updated 3 weeks ago
- ☆10Aug 3, 2020Updated 5 years ago
- Experience any location across time with AI-powered visualization☆28Jan 14, 2026Updated last month
- A powerful integration that combines Browserbase's Stagehand with Mastra for advanced web automation, scraping, and AI-powered web intera…☆34Updated this week
- Remove NotebookLM watermarks from slides. Local processing, no upload needed.☆37Jan 15, 2026Updated last month
- Template app using Cloudflare Workers, Hono, and Replicate to generate images using Flux Schnell☆17Feb 13, 2025Updated last year
- ☆10Apr 2, 2022Updated 3 years ago
- Fastapi Api template☆11Feb 9, 2026Updated 3 weeks ago
- ☆16Sep 18, 2025Updated 5 months ago
- vue脚手架搭建多页以及适配移动端框架,加入autoprefixer+vw适配方案,适合移动端开发,加入热更新,以及打包去除console.log打印日志。☆10Apr 14, 2019Updated 6 years ago
- ☆13Apr 18, 2025Updated 10 months ago
- Implementation of "Fast bilateral filtering for the display of high-dynamic-range images"☆10Aug 30, 2022Updated 3 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- ☆14Jun 10, 2025Updated 8 months ago
- ⚛ opinionated electron application template☆12Jun 20, 2024Updated last year
- ☆47Feb 19, 2024Updated 2 years ago
- 一个语音识别项目☆49May 13, 2025Updated 9 months ago
- 对各类图书资源的收集。大量计算机、AI方面书籍。☆11Jul 5, 2021Updated 4 years ago
- I am curating best Black Friday and Cyber Monday deals for developers, mostly learning resource to prepare for coding and system design i…☆29Nov 26, 2025Updated 3 months ago