Haschtl / transcripy
Multi speaker audio transcription
☆37Updated 2 years ago
Alternatives and similar repositories for transcripy:
Users that are interested in transcripy are comparing it to the libraries listed below
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆16Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆39Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆205Updated last week
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆218Updated 2 weeks ago
- On-device speaker diarization powered by deep learning☆43Updated last month
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆74Updated 2 months ago
- An Agent that makes outbound calls using SIP and Dispatch APIs☆18Updated last week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- Transcription with speaker diarization pipeline☆92Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆42Updated 3 weeks ago
- Clip any moment from any video with prompts☆105Updated 3 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.☆54Updated 2 months ago
- Speaker diarization model☆26Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆46Updated this week
- Transcribe a live audio-stream in near real time using OpenAI-Whisper. Monitor it for keywords and trigger alarm to Signal messenger☆28Updated 2 years ago
- Whisper from OpenAi and diarization with Pyannote☆38Updated last year
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆19Updated last year
- Podalize: Podcast Transcription and Analysis☆155Updated 7 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆127Updated 8 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆202Updated 2 months ago
- A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and …☆196Updated 6 months ago
- 🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.☆27Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆44Updated 2 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆62Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆62Updated 3 years ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- A voice to text keyboard based on OpenAI Whisper Model.☆50Updated last year