sofi444 / realtime-transcription-fastrtcLinks

Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗

☆683

Alternatives and similar repositories for realtime-transcription-fastrtc

Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below

Sorting:

freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆309Updated 6 months ago
kyutai-labs / unmute
Make text LLMs listen and speak
☆904Updated last week
fluxions-ai / vui
☆634Updated 2 months ago
pipecat-ai / smart-turn
☆961Updated 3 weeks ago
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,291Updated 5 months ago
SouthBridgeAI / offmute
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆556Updated 4 months ago
superlinear-ai / raglite
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
☆1,086Updated last week
Deluxer / oliva
Oliva Multi-Agent Assistant
☆382Updated 5 months ago
Softlandia-Ltd / vision-is-all-you-need
Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo
☆394Updated 3 months ago
tjmlabs / ColiVara
Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…
☆1,263Updated 5 months ago
harishsg993010 / LLM-Reasoner
Make any LLM to think like OpenAI o1 and deepseek R1
☆489Updated 8 months ago
lucasnewman / f5-tts-mlx
Implementation of F5-TTS in MLX
☆586Updated 6 months ago
pipecat-ai / pipecat-flows
Open source conversation framework and visual editor for structured Pipecat dialogues
☆457Updated last week
astramind-ai / Auralis
A Fast TTS Engine
☆549Updated 8 months ago
senstella / csm-mlx
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆378Updated last month
edwko / OuteTTS
Interface for OuteTTS models.
☆1,384Updated 3 months ago
dsa / fast-voice-assistant
⚡ Insanely fast AI voice assistant with <500ms response times
☆574Updated 10 months ago
NVIDIA-AI-Blueprints / pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
☆749Updated 4 months ago
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆859Updated 2 months ago
eyelevelai / groundx-on-prem
A Kubernetes deployable instance of GroundX for document parsing, storage, and search.
☆798Updated this week
Mega4alik / ollm
☆1,743Updated this week
multinear-demo / demo-bank-support-lc-py
☆417Updated 10 months ago
senstella / parakeet-mlx
An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.
☆504Updated last week
kyutai-labs / delayed-streams-modeling
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,413Updated 2 weeks ago
satvik314 / opensource_notebooklm
An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.
☆282Updated 9 months ago
farshed / sage
Self-hosted voice chat with LLMs
☆462Updated 7 months ago
johnmai-dev / NotebookMLX
📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)
☆316Updated 7 months ago
asiff00 / On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…
☆195Updated 5 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆240Updated 8 months ago
menloresearch / ichigo
Local realtime voice AI
☆2,368Updated 7 months ago