sofi444 / realtime-transcription-fastrtcLinks
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆661Updated last week
Alternatives and similar repositories for realtime-transcription-fastrtc
Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below
Sorting:
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆294Updated 2 months ago
- Oliva Multi-Agent Assistant☆369Updated 2 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆553Updated 3 weeks ago
- ☆577Updated this week
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 4 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,125Updated 2 months ago
- ☆754Updated 2 months ago
- Interface for OuteTTS models.☆1,318Updated last week
- ☆662Updated 3 weeks ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆758Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,138Updated last month
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆671Updated 3 weeks ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆359Updated last month
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆382Updated 7 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,021Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆440Updated 2 months ago
- ☆488Updated this week
- A Fast TTS Engine☆517Updated 5 months ago
- ☆257Updated 8 months ago
- Implementation of F5-TTS in MLX☆555Updated 3 months ago
- A Chrome extension for asking questions over websites☆341Updated 4 months ago
- Run Orpheus 3B Locally With LM Studio☆428Updated 3 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆912Updated 8 months ago
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆462Updated last week
- Whisper with Medusa heads☆843Updated 3 weeks ago
- Local realtime voice AI☆2,328Updated 3 months ago
- Open source conversation framework and visual editor for structured Pipecat dialogues☆348Updated last week
- When RAG and agents fall in love☆328Updated 6 months ago
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆223Updated 7 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆211Updated 8 months ago