showlab / whisperVideoLinks
Find out who said what in the video.
☆130Updated 3 weeks ago
Alternatives and similar repositories for whisperVideo
Users that are interested in whisperVideo are comparing it to the libraries listed below
Sorting:
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆123Updated 2 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆129Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆72Updated 5 months ago
- a super fast llm response using small llm model to prefix large llm model☆241Updated 3 weeks ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Updated 2 years ago
- coze api to openai☆15Updated last year
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆44Updated last week
- 致力于更优雅的AI生成短视频,并提供webui界面,方便创作☆80Updated 2 weeks ago
- ☆171Updated last year
- Precision Alignment, Infinite Possibilities☆117Updated last week
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated 2 weeks ago
- ☆167Updated last year
- ☆79Updated 9 months ago
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆74Updated last week
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆143Updated 3 months ago
- ☆81Updated this week
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated last year
- AI-Powered Video Retrieval & Clipping Tool☆386Updated 5 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆227Updated last month
- AI agent that is compatible with multiple LLM models☆191Updated 3 months ago
- Model Context Protocol服务器,用于抓取微博用户信息、动态和搜索功能☆30Updated 5 months ago
- 🚀全新重构!论文阅读工具,一键截图AI翻译,支持数学公式,贴片截图,窗口锁定,归档管理☆135Updated last week
- Code for ACL25-findings. An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social g…☆90Updated 3 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆84Updated this week
- ☆47Updated 10 months ago
- 这是一个基于 Next.js 构建的多语言 AI 模型评估平台,支持多模型对比和实时流式响应。A multilingual AI model evaluation platform built with Next.js, allowing users to compare …☆97Updated 2 months ago
- Bilibili video search MCP (Model Context Protocol) service - 哔哩哔哩视频搜索MCP服务☆141Updated 3 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated last year
- Simple script to quickly implement DDNS based on CloudFlare.☆17Updated 5 months ago
- ☆49Updated 5 months ago