TheStageAI / TheWhisperLinks
Optimized Whisper models for streaming and on-device use
☆482Updated last week
Alternatives and similar repositories for TheWhisper
Users that are interested in TheWhisper are comparing it to the libraries listed below
Sorting:
- An open source web crawler that searches the internet☆236Updated 2 months ago
- ☆426Updated 2 weeks ago
- Make text LLMs listen and speak☆949Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆427Updated 2 months ago
- Frontend Repository for Elysia☆157Updated this week
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆367Updated 2 months ago
- ☆693Updated last month
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆950Updated last week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆268Updated 3 weeks ago
- Python package and backend for the Elysia platform app.☆1,785Updated last week
- An open-source implementation of Whisper☆454Updated last week
- ComfyDeployed☆424Updated last month
- VLLM Port of the Chatterbox TTS model☆325Updated 3 weeks ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 3 months ago
- Open-source framework for developing real-time multimodal conversational AI agents.☆491Updated last week
- mem-agent mcp server☆554Updated last month
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhere☆767Updated 3 weeks ago
- Add long-term memory to any AI in minutes. Self-hosted, open, and framework-free.☆1,376Updated this week
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆605Updated last week
- This repo contains fully functional Hyperbrowser powered web apps☆236Updated 2 weeks ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,018Updated this week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆340Updated 6 months ago
- The Supabase of AI era. A modular, open-source backend for building AI-native software — designed for knowledge, not static data.☆414Updated 5 months ago
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,026Updated 3 months ago
- ☆634Updated 3 months ago
- ☆156Updated 2 weeks ago
- gpt-oss + voice-ui-kit experiment☆150Updated 3 months ago
- Semantic search and document parsing tools for the command line☆1,403Updated last month
- Open-Source Memory Engine for LLMs, AI Agents & Multi-Agent Systems☆1,487Updated last week
- ☆141Updated last month