Optimized Whisper models for streaming and on-device use
☆821Mar 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for TheWhisper
Users that are interested in TheWhisper are comparing it to the libraries listed below
Sorting:
- ☆10Feb 14, 2025Updated last year
- Data recipes and robust infrastructure for training AI agents☆110Updated this week
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 8 months ago
- Track and manage your recurring subscriptions☆31Nov 9, 2025Updated 4 months ago
- Patient Intake Form Extraction using llm☆15May 29, 2025Updated 9 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 6 months ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- ☆25Oct 25, 2025Updated 4 months ago
- An open-source implementation of Whisper☆479Oct 29, 2025Updated 4 months ago
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- Create a QnA bot on a pdf☆16May 27, 2023Updated 2 years ago
- A minimal Python library for live coding visual scenes using desktop windows.☆71Updated this week
- ☆18Feb 23, 2026Updated 3 weeks ago
- Video chat with Modal's mascots, Moe and Dal, about Modal and its documentation.☆58Mar 1, 2026Updated 2 weeks ago
- Identity preservation protocol for AI agents. Build walls around who you are.☆36Mar 8, 2026Updated last week
- Summon it with a keystroke, throw in anything you want to remember or ask your own memory. A local LLM agent that restructures your knowl…☆109Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- TTS model capable of streaming conversational audio in realtime.☆1,097Nov 29, 2025Updated 3 months ago
- On-device TTS model by Neuphonic☆5,012Mar 11, 2026Updated last week
- ComfyUI unofficial implementation of Thera - Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆36Jan 2, 2026Updated 2 months ago
- Template app using Cloudflare Workers, Hono, and Replicate to generate images using Flux Schnell☆17Feb 13, 2025Updated last year
- ☆10May 31, 2023Updated 2 years ago
- a lightweight data pipeline framework for running audio or binary processing jobs at small scale using django and RQ☆50Jan 11, 2026Updated 2 months ago
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆17Nov 20, 2023Updated 2 years ago
- Letting agents safely operate on your local file system in browser with no backend.☆49Dec 11, 2025Updated 3 months ago
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,213Sep 12, 2025Updated 6 months ago
- ☆76Feb 18, 2026Updated last month
- 🔌 Plug-and-play library to enable agents to call MCP and UTCP tools via code execution.☆1,392Feb 8, 2026Updated last month
- ☆16Feb 19, 2026Updated last month
- Universal MCP server installer - install any MCP server to any AI agent with one command☆17Feb 14, 2026Updated last month
- Welcome!☆140Dec 13, 2024Updated last year
- Example scripts for vmux - run any command in the cloud☆41Feb 1, 2026Updated last month
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆24Sep 21, 2025Updated 5 months ago
- Orca is a workspace for vibe coding built upon the principals of tracking what the agent changes and only keeping what you want☆50Mar 13, 2026Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆350Apr 10, 2025Updated 11 months ago
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- ☆26Oct 15, 2024Updated last year
- Claude Code agent that routes to external LLMs (Grok, Gemini, GPT-5, etc.) via OpenRouter - just mention the model name☆29Nov 30, 2025Updated 3 months ago