dimastatz / whisper-flowLinks
Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.
☆503Updated 11 months ago
Alternatives and similar repositories for whisper-flow
Users that are interested in whisper-flow are comparing it to the libraries listed below
Sorting:
- Press shortcut → speak → get text. Free and open source ❤️☆247Updated 4 months ago
- AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning☆520Updated 2 months ago
- AI writing agent powered by gemini 3 flash - autonomously creates novels and stories with deep reasoning☆250Updated last month
- The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Deskto…☆450Updated 7 months ago
- MCP server retrieving transcripts of YouTube videos☆290Updated this week
- ☆401Updated last month
- AI agents platform that gives you a workspace with an integrated team of personal assistants that can work behind the scenes to handle da…☆193Updated 6 months ago
- An OS for your agents, built for your pocket.☆795Updated 3 months ago
- Local Groq Desktop chat app with MCP support☆383Updated this week
- Make your meetings accessible to AI Agents☆421Updated 2 months ago
- 🔥 Visual AI research assistant that displays real-time thinking, provides split-view analysis, and automatic citations using Claude and …☆597Updated 6 months ago
- An agent that uses OpenAI's Agents SDK to generate new agents☆405Updated 4 months ago
- This is an MCP server that allows you to directly download transcripts of YouTube videos.☆456Updated last month
- A highly customizable, lightweight, and open-source coding CLI powered by Groq for instant iteration.☆701Updated last month
- Spawn agents anywhere in one keypress☆129Updated this week
- Open-source framework for developing real-time multimodal conversational AI agents.☆587Updated this week
- Teleport Claude Code, Codex or Gemini CLI to your phone and work anywhere☆248Updated this week
- Clean, LLM-optimized Reddit MCP server. Browse posts, search content, analyze users. No fluff, just Reddit data.☆362Updated this week
- mem-agent mcp server☆604Updated 2 months ago
- [DEPRECATED] Ito, smart dictation in every application☆560Updated 2 weeks ago
- Browser Operator - The AI browser with built in Multi-Agent platform! Open source alternative to ChatGPT Atlas, Perplexity Comet, Dia and…☆423Updated last week
- MCP Server for code graph analysis and visualization by CodeGPT☆374Updated 2 months ago
- 🔥 A tool to analyze your website's AI-readiness, powered by Firecrawl☆233Updated 4 months ago
- Use Claude Code with any LLM provider - GLM-4.5, Kimi-K2, Qwen3-Coder, DeepSeek, etc.☆377Updated 4 months ago
- The memory-first coding agent☆914Updated this week
- Lobster is a Clawdbot-native workflow shell: a typed, local-first “macro engine” that turns skills/tools into composable pipelines and sa…☆335Updated this week
- A Multi-modal MCP client for voice powered agentic workflows☆209Updated 11 months ago
- A prompt optimization system that adapts your prompts for different AI providers.☆158Updated last month
- ☆159Updated last month
- An adaptive multi-agent system that extracts your literary DNA through conversation and generates actionable reading profiles.☆168Updated 2 months ago