mave5 / podalize
Podalize: Podcast Transcription and Analysis
β154Updated 6 months ago
Alternatives and similar repositories for podalize:
Users that are interested in podalize are comparing it to the libraries listed below
- A curated list of awesome OpenAI's Whisperβ99Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ329Updated 2 years ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β205Updated 4 months ago
- β480Updated last year
- web based editor for subtitles and transcriptsβ126Updated 7 months ago
- Transcription and Diarization based on OpenAI's Whisperβ21Updated last year
- ez audio transcription tool with flexible processing and post-processing optionsβ147Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β199Updated last month
- Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minuteβ78Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cppβ50Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ480Updated last year
- Transcription with speaker diarization pipelineβ90Updated last year
- A lightweight transcript editor for editing and correcting STT generated timed transcriptsβ45Updated this week
- Dictation app based on the OpenAI speech-to-text modelsβ175Updated 8 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.β187Updated 2 years ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio traβ¦β51Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ200Updated 9 months ago
- β91Updated last year
- Streaming transcriber with whisperβ688Updated last year
- β46Updated last year
- β571Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 10 months ago
- A langchain app to visualise a debate using Tree-of-Thought reasoningβ59Updated last year
- Transcription and diarization (speaker identification)β31Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ195Updated last month
- β21Updated 3 years ago
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Playerβ140Updated 4 months ago
- β2Updated last year
- Speech to text to speech using Elevenlabsβ28Updated last year
- Meeper π - is your secretary for any in-browser conference.β67Updated last year