dimastatz / whisper-flowLinks
Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.
☆474Updated 10 months ago
Alternatives and similar repositories for whisper-flow
Users that are interested in whisper-flow are comparing it to the libraries listed below
Sorting:
- Open-source framework for developing real-time multimodal conversational AI agents.☆553Updated this week
- ☆383Updated 3 weeks ago
- The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Deskto…☆437Updated 7 months ago
- Make your meetings accessible to AI Agents☆418Updated last month
- AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning☆515Updated 2 months ago
- MCP server retrieving transcripts of YouTube videos☆269Updated last week
- AI agents platform that gives you a workspace with an integrated team of personal assistants that can work behind the scenes to handle da…☆192Updated 5 months ago
- Local Groq Desktop chat app with MCP support☆381Updated this week
- AI writing agent powered by gemini 3 flash - autonomously creates novels and stories with deep reasoning☆236Updated 2 weeks ago
- Spawn agents anywhere in one keypress☆126Updated this week
- An OS for your agents, built for your pocket.☆787Updated 2 months ago
- Ito, smart dictation in every application☆553Updated 3 weeks ago
- mem-agent mcp server☆599Updated last month
- Press shortcut → speak → get text. Free and open source ❤️☆223Updated 3 months ago
- Browser Operator - The AI browser with built in Multi-Agent platform! Open source alternative to ChatGPT Atlas, Perplexity Comet, Dia and…☆385Updated this week
- next-generation AI memory infrastructure (powered by mem0 and graphiti)☆164Updated last month
- A highly customizable, lightweight, and open-source coding CLI powered by Groq for instant iteration.☆694Updated 3 weeks ago
- An agent that uses OpenAI's Agents SDK to generate new agents☆403Updated 3 months ago
- 🔥 Visual AI research assistant that displays real-time thinking, provides split-view analysis, and automatic citations using Claude and …☆441Updated 6 months ago
- Components, hooks and template apps for building React voice AI applications quickly. Designed to support and accelerate Pipecat AI devel…☆204Updated this week
- gpt-oss + voice-ui-kit experiment☆152Updated 5 months ago
- Pipecat voice AI agents running locally on macOS☆300Updated 4 months ago
- A prompt optimization system that adapts your prompts for different AI providers.☆153Updated last month
- An adaptive multi-agent system that extracts your literary DNA through conversation and generates actionable reading profiles.☆162Updated 2 months ago
- Make text LLMs listen and speak☆1,058Updated 2 weeks ago
- ☆189Updated 9 months ago
- A Model Context Protocol (MCP) server that bridges Video & Audio content with Large Language Models using yt-dlp.☆183Updated this week
- PocketFlow's node-based workflow structure, with Manus' agents and tools!☆288Updated 2 months ago
- Open source conversation framework for structured Pipecat dialogues☆519Updated 2 weeks ago
- ☆47Updated last year