NullMagic2 / SoftWhisper
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.
☆327Updated last week
Alternatives and similar repositories for SoftWhisper:
Users that are interested in SoftWhisper are comparing it to the libraries listed below
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆329Updated 2 months ago
- ☆210Updated 3 months ago
- ☆157Updated 5 months ago
- Trans Router☆159Updated 3 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆295Updated 2 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆115Updated last month
- A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SER…☆201Updated last week
- A concise list for mcp servers☆618Updated 2 weeks ago
- Googles NotebookLM but local☆191Updated 2 weeks ago
- ☆59Updated last month
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆349Updated 6 months ago
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆114Updated 2 months ago
- Scout 是一个基于 Roo Code VS Code 扩展 设计的实验性 Agent 实现。它专注于通过模拟人类行为进行精准的网络信息收集、研究与交互,旨在将 Roo Code 转变为一个强大的 Web 研究助手。☆69Updated this week
- ☆100Updated this week
- ☆104Updated 5 months ago
- A Next.js-based AI writing assistant supporting multiple LLM APIs (OpenAI, Claude, Gemini, etc.) with rich style customization features t…☆339Updated 3 weeks ago
- Your AI Dino Pal on Menubar☆108Updated 2 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆96Updated this week
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆139Updated last month
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP☆443Updated last week
- VoiceCanvas,支持Stripe支付的文本转语音系统, 支持声音克隆,支持50+语言,支持选择音色,代码100%开源☆288Updated 2 weeks ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆238Updated last week
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆75Updated 6 months ago
- 快速分享大模型生成的HTML、Markdown、SVG、Mermaid代码☆72Updated 2 weeks ago
- Markdown Conversion☆322Updated 3 weeks ago
- A web interface for MarkItDown file converter☆36Updated last month
- ☆74Updated this week
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆392Updated this week
- Pagetalk is a beautiful browser extension that allows you to use Gemini to read page content and have multiple conversations.☆29Updated this week
- 儿童有声读物的智能化自动化合生成,使用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物☆83Updated 3 months ago