NullMagic2 / SoftWhisperLinks
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.
☆379Updated 3 weeks ago
Alternatives and similar repositories for SoftWhisper
Users that are interested in SoftWhisper are comparing it to the libraries listed below
Sorting:
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆338Updated last week
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆310Updated last week
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆183Updated 2 weeks ago
- ☆156Updated 7 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆314Updated 4 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆114Updated 8 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆121Updated 4 months ago
- Trans Router☆162Updated 5 months ago
- A Fast TTS Engine☆514Updated 5 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆356Updated 8 months ago
- A real-time Agent framework for audio and video.☆137Updated last week
- ☆215Updated 2 weeks ago
- ☆212Updated 5 months ago
- ☆59Updated 3 months ago
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆327Updated last month
- Self-hosted voice chat with LLMs☆432Updated 3 months ago
- AI-Powered Video Retrieval & Clipping Tool☆185Updated this week
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆152Updated 3 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆246Updated last month
- Scout 是一个基于 Roo Code VS Code 扩展 设计的实验性 Agent 实现。它专注于通过模拟人类行为进行精准的网络信息收集、研究与交互,旨在将 Roo Code 转变为一个强大的 Web 研究助手。☆96Updated 2 months ago
- Googles NotebookLM but local☆291Updated 2 months ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions☆278Updated this week
- KI-prompt for unlocking 92,25% of all paywalls☆52Updated 4 months ago
- Pagetalk is a beautiful browser extension that allows you to use Gemini to read page content and have multiple conversations.☆182Updated this week
- ☆96Updated last week
- A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SER…☆269Updated 2 months ago
- 儿童有声读物的智能化自动化合生成,使用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物☆88Updated 5 months ago
- 44000+ 词汇语料库☆353Updated 2 weeks ago
- ☆237Updated 7 months ago
- ☆80Updated this week