Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆704Jul 10, 2025Updated 10 months ago
Alternatives and similar repositories for realtime-transcription-fastrtc
Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The python library for real-time communication☆4,584Jan 12, 2026Updated 4 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆354Apr 10, 2025Updated last year
- Towards Human-Sounding Speech☆6,148Dec 5, 2025Updated 5 months ago
- Oliva Multi-Agent Assistant☆387Apr 11, 2025Updated last year
- ☆170Aug 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Local realtime voice AI☆2,484Nov 26, 2025Updated 5 months ago
- Build local voice agents with open-source models☆4,755Updated this week
- Interface for OuteTTS models.☆1,431Mar 23, 2026Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,621Nov 12, 2025Updated 6 months ago
- YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights fro…☆593Mar 27, 2025Updated last year
- ☆1,366Mar 3, 2026Updated 2 months ago
- ☆12,887Oct 25, 2025Updated 6 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,211May 5, 2026Updated 2 weeks ago
- A Conversational Speech Generation Model☆14,627May 27, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open Source framework for voice and multimodal conversational AI☆12,260Updated this week
- A fast multimodal LLM for real-time voice☆4,424Dec 12, 2025Updated 5 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,896Apr 4, 2026Updated last month
- Official inference framework for 1-bit LLMs☆39,062Mar 10, 2026Updated 2 months ago
- Faster Whisper transcription with CTranslate2☆23,039Nov 19, 2025Updated 6 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆565Apr 8, 2026Updated last month
- Real-Time Voice Inference Web SDK☆313May 15, 2026Updated last week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,459Apr 15, 2025Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Oct 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast State-of-the-Art Static Embeddings☆2,071May 6, 2026Updated 2 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,083Jan 8, 2025Updated last year
- A nearly-live implementation of OpenAI's Whisper.☆4,031May 15, 2026Updated last week
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 10 months ago
- ☆1,380Jan 29, 2026Updated 3 months ago
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,882May 4, 2026Updated 2 weeks ago
- Build, run, and manage agent platforms.☆40,135May 15, 2026Updated last week
- Inference and training library for high-quality TTS models.☆5,575Dec 10, 2024Updated last year
- Get your documents ready for gen AI☆59,909Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Build datasets using natural language☆575Sep 19, 2025Updated 8 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,644Apr 8, 2025Updated last year
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆64,485Updated this week
- ☆33Nov 21, 2025Updated 6 months ago
- Whisper with Medusa heads☆862Aug 6, 2025Updated 9 months ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,767May 14, 2026Updated last week
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆152May 18, 2025Updated last year