sofdog-gh/realtime-transcription-fastrtc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sofdog-gh/realtime-transcription-fastrtc)

sofdog-gh / realtime-transcription-fastrtc

Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗

☆701

Alternatives and similar repositories for realtime-transcription-fastrtc

Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gradio-app / fastrtc
View on GitHub
The python library for real-time communication
☆4,616Jan 12, 2026Updated 6 months ago
freddyaboulton / orpheus-cpp
View on GitHub
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆353Apr 10, 2025Updated last year
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,258Dec 5, 2025Updated 7 months ago
Vaibhavs10 / llama-assistant
View on GitHub
☆171Aug 16, 2024Updated last year
Deluxer / oliva
View on GitHub
Oliva Multi-Agent Assistant
☆385Apr 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
janhq / ichigo
View on GitHub
Local realtime voice AI
☆2,490Nov 26, 2025Updated 7 months ago
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,436Mar 23, 2026Updated 4 months ago
ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,653Nov 12, 2025Updated 8 months ago
huggingface / speech-to-speech
View on GitHub
Build local voice agents with open-source models
☆6,309Updated this week
wassim249 / YT-Navigator
View on GitHub
YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights fro…
☆601Mar 27, 2025Updated last year
huggingface / ember
View on GitHub
ANE accelerated embedding models!
☆20Dec 11, 2024Updated last year
kyutai-labs / hibiki
View on GitHub
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,488Apr 15, 2025Updated last year
philschmid / gemini-samples
View on GitHub
☆1,368Mar 3, 2026Updated 4 months ago
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,686May 16, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mrorigo / agentic-deep-graph-reasoning
View on GitHub
Agentic Deep Graph Reasoning Implementation
☆14Mar 4, 2025Updated last year
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,696May 27, 2025Updated last year
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,479Dec 12, 2025Updated 7 months ago
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,995Oct 25, 2025Updated 9 months ago
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,687Updated this week
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,674Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,229Jul 13, 2026Updated last week
SouthBridgeAI / llm-transcription-study
View on GitHub
Useful resources for LLM-based Diarization and Transcription.
☆55Oct 15, 2024Updated last year
SouthBridgeAI / offmute
View on GitHub
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆568Apr 8, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
pipecat-ai / pipecat-client-web
View on GitHub
Real-Time Voice Inference Web SDK
☆320Jul 17, 2026Updated last week
MinishLab / model2vec
View on GitHub
Fast State-of-the-Art Static Embeddings
☆2,166Jun 6, 2026Updated last month
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,097Jan 8, 2025Updated last year
collabora / WhisperLive
View on GitHub
A nearly-live implementation of OpenAI's Whisper.
☆4,153Jul 17, 2026Updated last week
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,503Nov 19, 2025Updated 8 months ago
Lightning-AI / LitServe
View on GitHub
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
☆3,919Updated this week
argilla-io / synthetic-data-generator
View on GitHub
Build datasets using natural language
☆587Sep 19, 2025Updated 10 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,581Dec 10, 2024Updated last year
aiola-lab / whisper-medusa
View on GitHub
Whisper with Medusa heads
☆860Jul 2, 2026Updated 3 weeks ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
huggingface / huggingface-gemma-recipes
View on GitHub
Inference, Fine Tuning and many more recipes with Gemma family of models
☆304Apr 2, 2026Updated 3 months ago
ysharma3501 / FlashSR
View on GitHub
Fast audio super resolution from 16khz to 48khz.
☆215Jan 3, 2026Updated 6 months ago
AK391 / ai-gradio
View on GitHub
A Python package that makes it easy for developers to create AI apps powered by various AI providers.
☆1,642Apr 8, 2025Updated last year
souzatharsis / podcastfy
View on GitHub
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…
☆6,455May 4, 2026Updated 2 months ago
juanmc2005 / diart
View on GitHub
A python package to build AI-powered real-time audio applications
☆2,005Jun 19, 2026Updated last month
pipecat-ai / smart-turn
View on GitHub
☆1,483Jan 29, 2026Updated 5 months ago
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,409Updated this week