sofdog-gh / realtime-transcription-fastrtcLinks
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆691Updated 5 months ago
Alternatives and similar repositories for realtime-transcription-fastrtc
Users that are interested in realtime-transcription-fastrtc are comparing it to the libraries listed below
Sorting:
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆346Updated 8 months ago
- ☆635Updated last month
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,343Updated 8 months ago
- Make text LLMs listen and speak☆1,028Updated last week
- ☆1,151Updated 2 weeks ago
- Oliva Multi-Agent Assistant☆388Updated 8 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆563Updated 3 weeks ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆402Updated 5 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated 10 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆774Updated 6 months ago
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆473Updated 3 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,124Updated this week
- ☆531Updated 2 months ago
- Whisper with Medusa heads☆864Updated 4 months ago
- A Fast TTS Engine☆599Updated 10 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆548Updated 2 weeks ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆801Updated this week
- Implementation of F5-TTS in MLX☆603Updated 8 months ago
- Open source conversation framework for structured Pipecat dialogues☆499Updated last week
- Local realtime voice AI☆2,386Updated 3 weeks ago
- ☆1,347Updated 8 months ago
- Interface for OuteTTS models.☆1,414Updated 5 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆388Updated 4 months ago
- ☆209Updated 10 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆879Updated 6 months ago
- ☆318Updated 11 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆215Updated 3 weeks ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,467Updated this week