tsmdt / whisply
π¬ Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!
β41Updated 2 weeks ago
Alternatives and similar repositories for whisply:
Users that are interested in whisply are comparing it to the libraries listed below
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β53Updated 4 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.β44Updated 2 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β54Updated 8 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ149Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β62Updated 2 weeks ago
- Web EPUB and PDF text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks. Use your own Koβ¦β118Updated 2 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β105Updated 2 months ago
- Transcription and annotation interface for recorded audio or video filesβ33Updated this week
- Create text chunks which end at natural stopping points without using a tokenizerβ26Updated last month
- Self-hosted Ollama + Whisper powered AI medical scribe.β23Updated this week
- A browser interface based on the Gradio library for OpenAI's Whisper model.β40Updated last year
- Public repository with all of our apps.β73Updated last week
- WebUI for ScAIbeβ37Updated 3 months ago
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.β16Updated this week
- β91Updated 2 months ago
- Bookmarklet to pull and run hugging face GGUF models in Ollamaβ14Updated 5 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.β33Updated 7 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β205Updated last week
- β78Updated 3 weeks ago
- Your personal and private AIβ45Updated last week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking aroundβ¦β54Updated 7 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.β74Updated 2 months ago
- Integrates AI tools into Microsoft Wordβ126Updated 3 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UIβ56Updated 4 months ago
- EmailGenius: AI-Driven Email Categorizationβ25Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β202Updated 2 months ago
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.β57Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11Updated last year
- Something similar to Apple Intelligence?β59Updated 9 months ago
- Easily create LLM automation/agent workflowsβ59Updated last year