meronym / speaker-transcription
Transcription with speaker diarization pipeline
☆85Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speaker-transcription
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆166Updated last month
- ☆152Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Fine tune SDXL on YouTube videos☆172Updated 2 months ago
- A curated list of awesome OpenAI's Whisper☆93Updated last year
- Generate visual podcasts about novels using open source models☆23Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated last week
- A testing repo to share code and thoughts on diarisation☆51Updated 7 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆152Updated last month
- streaming speech to text server using Whisper☆83Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆187Updated 5 months ago
- The code for some apps built with Sieve.☆70Updated 3 weeks ago
- ☆253Updated 5 months ago
- Cog wrapper for Coqui / xtts-v2☆67Updated 10 months ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- Performant and accurate speech recognition built on Pytorch☆248Updated 2 years ago
- ImageBind One Embedding Space to Bind Them All☆17Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆122Updated 6 months ago
- Talk to GPT-4 and create a story together.☆84Updated 11 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- ☆84Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- ☆42Updated last month
- OpenAI Whisper + davinci for podcast summarization☆70Updated last year
- Turn text from websites into spoken audio with edge-tts and save as mp3 files☆28Updated this week
- whisper.cpp bindings for python☆76Updated last year
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated last year