sofi444 / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆647Updated this week
Alternatives and similar repositories for realtime-transcription-fastrtc
Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below
Sorting:
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆274Updated last month
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆544Updated last month
- Oliva Multi-Agent Assistant☆352Updated last month
- ☆721Updated 3 weeks ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,030Updated last month
- A Fast TTS Engine☆495Updated 3 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆488Updated 3 months ago
- ☆608Updated this week
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆366Updated 5 months ago
- Interface for OuteTTS models.☆1,214Updated 2 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆903Updated 2 weeks ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆709Updated last week
- Implementation of F5-TTS in MLX☆535Updated last month
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆335Updated this week
- Open source conversation framework and visual editor for structured Pipecat dialogues☆309Updated last week
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆640Updated 3 weeks ago
- Multi-modal conversational AI (xRx) system☆301Updated 4 months ago
- Run Orpheus 3B Locally With LM Studio☆401Updated last month
- Whisper with Medusa heads☆833Updated 2 weeks ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆929Updated this week
- Local realtime voice AI☆2,290Updated 2 months ago
- ☆413Updated 5 months ago
- ☆251Updated 7 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆353Updated 3 weeks ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆225Updated 3 months ago
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆447Updated last month
- Open source inference code for Rev's model☆402Updated 3 weeks ago
- 📄 🧠 PageIndex: Document Index System for Reasoning-based RAG☆732Updated last week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆902Updated 6 months ago
- first base model for full-duplex conversational audio☆1,741Updated 4 months ago