dimastatz / whisper-flowLinks
Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.
☆344Updated 9 months ago
Alternatives and similar repositories for whisper-flow
Users that are interested in whisper-flow are comparing it to the libraries listed below
Sorting:
- Local Groq Desktop chat app with MCP support☆372Updated last week
- Voice Powered Agent Delegation☆94Updated last week
- ☆186Updated 7 months ago
- Make your meetings accessible to AI Agents☆402Updated 2 weeks ago
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆284Updated 3 weeks ago
- Model Context Protocol server for Replicate's API☆88Updated 2 months ago
- A Multi-modal MCP client for voice powered agentic workflows☆207Updated 9 months ago
- MCP server retrieving transcripts of YouTube videos☆133Updated 2 weeks ago
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆349Updated 3 weeks ago
- next-generation AI memory infrastructure (powered by mem0 and graphiti)☆163Updated 3 months ago
- Pipecat voice AI agents running locally on macOS☆295Updated 3 months ago
- A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - implementing a three-tier archit…☆275Updated 4 months ago
- Model Context Protocol (MCP) Server for Graphlit Platform☆369Updated this week
- The agentic video editing framework☆179Updated 9 months ago
- Open source conversation framework for structured Pipecat dialogues☆486Updated last week
- A Model Context Protocol (MCP) server for research and documentation assistance using Perplexity AI. Won 1st @ Cline Hackathon☆269Updated 3 weeks ago
- MCP Server to Use HuggingFace spaces, easy configuration and Claude Desktop mode.☆368Updated 5 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆221Updated last month
- MCP server that execute applescript giving you full control of your Mac☆378Updated last week
- ☆156Updated last month
- The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Deskto…☆416Updated 5 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 2 months ago
- A Model Context Protocol (MCP) server that helps read GitHub repository structure and important files.☆289Updated 10 months ago
- A Model-Context Protocol Server for YouTube☆470Updated 8 months ago
- Real-Time Voice Inference Web SDK☆291Updated last week
- This is an MCP server that allows you to directly download transcripts of YouTube videos.☆345Updated last month
- Surf is a computer use AI agent powered by OpenAI that interacts with a E2B's virtual desktop environment through natural language instru…☆641Updated last month
- ☆74Updated 5 months ago
- MCP server for browser-use☆78Updated 8 months ago
- Open-source framework for developing real-time multimodal conversational AI agents.☆531Updated last week