NullMagic2 / SoftWhisperLinks
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.
☆399Updated 3 weeks ago
Alternatives and similar repositories for SoftWhisper
Users that are interested in SoftWhisper are comparing it to the libraries listed below
Sorting:
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆371Updated last month
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆319Updated 6 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆125Updated 10 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆361Updated 10 months ago
- Trans Router☆166Updated 7 months ago
- ☆210Updated 7 months ago
- ☆282Updated 6 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆122Updated 6 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, …☆477Updated 2 weeks ago
- AI-Powered Video Retrieval & Clipping Tool☆320Updated this week
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆336Updated 3 weeks ago
- ☆164Updated 9 months ago
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆77Updated 11 months ago
- ☆104Updated 3 weeks ago
- A Fast TTS Engine☆537Updated 7 months ago
- a super fast llm response using small llm model to prefix large llm model☆230Updated 3 weeks ago
- A real-time Agent framework for audio and video.☆149Updated 2 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆154Updated 5 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆290Updated 2 months ago
- Your AI Dino Pal on Menubar☆109Updated 7 months ago
- AI视频剪辑☆201Updated last week
- ☆255Updated 9 months ago
- PageTalk is a beautiful browser extension that allows you to use Gemini to read page content and have multiple conversations.☆222Updated 2 weeks ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆277Updated 3 months ago
- This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, Sear…☆224Updated 8 months ago
- ☆156Updated 2 months ago
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆458Updated last month
- Scout 是一个基于 Roo Code VS Code 扩展 设计的 实验性 Agent 实现。它专注于通过模拟人类行为进行精准的网络信息收集、研究与交互,旨在将 Roo Code 转变为一个强大的 Web 研究助手。☆115Updated 4 months ago
- ☆59Updated 5 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆411Updated 3 months ago