NullMagic2 / SoftWhisperLinks
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.
☆367Updated 2 weeks ago
Alternatives and similar repositories for SoftWhisper
Users that are interested in SoftWhisper are comparing it to the libraries listed below
Sorting:
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆335Updated 4 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆121Updated 3 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆154Updated 2 weeks ago
- 🧠 Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Incuding: ChatGPT, Claude, …☆245Updated this week
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆319Updated 2 weeks ago
- ☆234Updated 6 months ago
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want) - now supports Thinking Models!☆155Updated 3 weeks ago
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆131Updated 3 months ago
- A Fast TTS Engine☆506Updated 4 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆315Updated 3 months ago
- ☆156Updated 7 months ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions☆65Updated this week
- ☆211Updated 4 months ago
- Pagetalk is a beautiful browser extension that allows you to use Gemini to read page content and have multiple conversations.☆136Updated this week
- Trans Router☆162Updated 4 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆108Updated 8 months ago
- Self-hosted voice chat with LLMs☆431Updated 3 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆408Updated 3 weeks ago
- Scout 是一个基于 Roo Code VS Code 扩展 设计的实验性 Agent 实现。它专注于通过模拟人类行为进行精准的网络信息收集、研究与交互,旨在将 Roo Code 转变为一个强大的 Web 研究助手。☆96Updated last month
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆236Updated 2 weeks ago
- A concise list for mcp servers☆674Updated 2 months ago
- Initialize any web chat with your code☆861Updated this week
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆355Updated 7 months ago
- ☆59Updated 3 months ago
- 44000+ 词汇语料库☆262Updated this week
- 智能故事创作项目 - 将AI概念转化为引人入胜的故事,基于专业叙事理论和创作技巧☆51Updated this week
- Googles NotebookLM but local☆267Updated last month
- ☆274Updated 3 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆149Updated 3 months ago
- Automate desktop apps like a browser. AI-native GUI automation for Windows. Fast, reliable, agent-ready.☆522Updated this week