dimastatz / whisper-flowLinks
Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.
☆332Updated 8 months ago
Alternatives and similar repositories for whisper-flow
Users that are interested in whisper-flow are comparing it to the libraries listed below
Sorting:
- Local Groq Desktop chat app with MCP support☆364Updated this week
- Open source conversation framework and visual editor for structured Pipecat dialogues☆473Updated last week
- Engineer your reusable, customizable, prompt library in Marimo reactive notebooks☆224Updated last year
- ☆184Updated 7 months ago
- A Multi-modal MCP client for voice powered agentic workflows☆204Updated 9 months ago
- The agentic video editing framework☆174Updated 8 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆246Updated last month
- AI agents platform that gives you a workspace with an integrated team of personal assistants that can work behind the scenes to handle da…☆190Updated 3 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 5 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆221Updated 3 weeks ago
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆339Updated last week
- ☆154Updated 2 weeks ago
- MCP Server built for use with VS Code / Cline / Anthropic - enable google search and ability to follow links and research websites☆156Updated 5 months ago
- A Model Context Protocol (MCP) server for research and documentation assistance using Perplexity AI. Won 1st @ Cline Hackathon☆265Updated this week
- Voice Powered Agent Delegation☆90Updated this week
- 🔥 Visual AI research assistant that displays real-time thinking, provides split-view analysis, and automatic citations using Claude and …☆289Updated 4 months ago
- Real-Time Voice Inference Web SDK☆288Updated 2 weeks ago
- MCP Server to Use HuggingFace spaces, easy configuration and Claude Desktop mode.☆365Updated 4 months ago
- ☆72Updated 4 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated last month
- An agent that uses OpenAI's Agents SDK to generate new agents☆396Updated last month
- The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Deskto…☆406Updated 4 months ago
- ☆220Updated 9 months ago
- Proposal for a flexible, tool-agnostic, codebase context system that helps teach AI coding tools about your codebase. Super easy to get …☆134Updated 6 months ago
- The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜☆208Updated 7 months ago
- VoiceMode MCP brings natural conversations to Claude Code☆423Updated last week
- next-generation AI memory infrastructure (powered by mem0 and graphiti)☆160Updated 2 months ago
- A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - implementing a three-tier archit…☆268Updated 3 months ago
- MCP server that execute applescript giving you full control of your Mac☆375Updated 5 months ago
- Pipecat voice AI agents running locally on macOS☆291Updated 2 months ago