jfgonsalves / parakeet-diarizedLinks
Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API
☆52Updated last month
Alternatives and similar repositories for parakeet-diarized
Users that are interested in parakeet-diarized are comparing it to the libraries listed below
Sorting:
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆71Updated this week
- Route select OpenAI API endpoints to local services (e.g. Whisper, Ollama, Kororo).☆22Updated 7 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- The PyVisionAI Official Repo☆104Updated 4 months ago
- Personnal collection of pipes and filters I use for open-webui☆19Updated last month
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆278Updated this week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆346Updated 8 months ago
- GoalChain for goal-orientated LLM conversation flows☆71Updated last year
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆44Updated last week
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆101Updated 5 months ago
- Generate Your Own Private Morning Radio for Commute☆33Updated 10 months ago
- A web application that converts speech to speech 100% private☆81Updated 6 months ago
- Self-hosted AI medical scribe.☆60Updated last week
- An AI assistant building SDK in python☆38Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆62Updated 10 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆330Updated last year
- WebUI for ScAIbe☆49Updated 6 months ago
- This is the backend for the entire Amurex project.☆136Updated 8 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆89Updated 10 months ago
- A novel media player that allows you to navigate by speaker☆82Updated last month
- Welcome!☆140Updated last year
- FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI☆39Updated last month
- Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆416Updated last month
- kokoro text to speech using javascript☆63Updated 10 months ago
- RocketRAG is a high-performance Retrieval-Augmented Generation (RAG) system designed with a focus on speed, simplicity, and extensibility…☆75Updated 3 months ago
- An open-source agent toolkit that auto-syncs SDK versions, docs, and examples—built for seamless integration with LLMs, and AI agents ( M…☆43Updated 4 months ago
- Your personal and private AI☆52Updated 8 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆53Updated 8 months ago