Optimized Whisper models for streaming and on-device use
☆888May 14, 2026Updated last week
Alternatives and similar repositories for TheWhisper
Users that are interested in TheWhisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Feb 14, 2025Updated last year
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 10 months ago
- Data recipes and robust infrastructure for training AI agents☆144May 14, 2026Updated last week
- Patient Intake Form Extraction using llm☆15May 29, 2025Updated 11 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 8 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- Remix + Preact for SSR, Preact + Islands + On Demand Compilation for client side interactions.☆12Sep 24, 2022Updated 3 years ago
- An open-source implementation of Whisper☆490Oct 29, 2025Updated 6 months ago
- ☆21Feb 14, 2026Updated 3 months ago
- Enemies for your LLM☆36Jan 20, 2026Updated 4 months ago
- Create a QnA bot on a pdf☆16May 27, 2023Updated 2 years ago
- A minimal Python library for live coding visual scenes using desktop windows.☆71Mar 14, 2026Updated 2 months ago
- Video chat with Modal's mascots, Moe and Dal, about Modal and its documentation.☆62Apr 8, 2026Updated last month
- ☆40Oct 5, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆38Feb 11, 2025Updated last year
- ☆32Jul 25, 2025Updated 9 months ago
- ☆198Jul 22, 2025Updated 9 months ago
- ☆19Feb 23, 2026Updated 2 months ago
- On-device TTS model by Neuphonic☆5,877Apr 24, 2026Updated 3 weeks ago
- ☆64Jan 7, 2026Updated 4 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆16Nov 20, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Disposable, ephemeral network infrastructure powered by GitHub Codespaces.☆115May 13, 2026Updated last week
- Letting agents safely operate on your local file system in browser with no backend.☆53Dec 11, 2025Updated 5 months ago
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,222Sep 12, 2025Updated 8 months ago
- Universal MCP server installer - install any MCP server to any AI agent with one command☆19Feb 14, 2026Updated 3 months ago
- ☆16Feb 19, 2026Updated 3 months ago
- 🔌 Plug-and-play library to enable agents to call MCP and UTCP tools via code execution.☆1,459May 3, 2026Updated 2 weeks ago
- Pal is a Personal Agent that Learns how you work by building a compounding knowledge base.☆257Apr 28, 2026Updated 3 weeks ago
- ConMamba for Automatic Speech Recognition☆104Aug 12, 2024Updated last year
- Orca is a workspace for vibe coding built upon the principals of tracking what the agent changes and only keeping what you want☆61May 9, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆354Apr 10, 2025Updated last year
- Open-source framework for developing real-time multimodal conversational AI agents.☆627Updated this week
- A toolkit of modern dotnet new templates for C# 14, .NET 10, Microsoft Orleans 10, Windows App SDK and Uno Platform 6☆17Apr 20, 2026Updated last month
- ☆26Oct 15, 2024Updated last year
- A very simple starter template to get started exploring Microsoft Orleans☆11Jul 9, 2025Updated 10 months ago
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆26Sep 21, 2025Updated 8 months ago