justinwlin / runpodWhisperx
Runpod WhisperX Docker Container Repo
β13Updated 11 months ago
Alternatives and similar repositories for runpodWhisperx:
Users that are interested in runpodWhisperx are comparing it to the libraries listed below
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β84Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extractionβ88Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ92Updated 9 months ago
- Transcription and diarization (speaker identification)β31Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ66Updated 2 years ago
- FastAPI service on top of WhisperXβ68Updated 3 weeks ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β65Updated 8 months ago
- Transcription with speaker diarization pipelineβ90Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ192Updated this week
- β36Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β47Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β57Updated last week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β135Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ145Updated 9 months ago
- create dataset from list of youtube links easilyβ17Updated last year
- Create an LJSpeech structured voice dataset on wave inputβ26Updated 4 months ago
- TorToiSe fine-tuning with DLASβ218Updated 6 months ago
- β44Updated this week
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)πβ18Updated last year
- Tools for making LJSpeech datasetsβ24Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...β40Updated 5 months ago
- A curated list of awesome OpenAI's Whisperβ99Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β48Updated 6 months ago
- β80Updated 7 months ago
- β350Updated 11 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β205Updated 3 months ago
- The code for some apps built with Sieve.β74Updated 2 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β122Updated 8 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ150Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ111Updated last year