Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.
☆48Mar 30, 2025Updated last year
Alternatives and similar repositories for offline-whisperx
Users that are interested in offline-whisperx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆33Apr 26, 2024Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcript…☆26Feb 5, 2026Updated 3 months ago
- A Python package to create XForms for ODK Collect.☆14Dec 27, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Media player developed under the Europeana Media Generic Services Project☆13Sep 7, 2023Updated 2 years ago
- Fulcrum Core☆13May 8, 2026Updated 2 weeks ago
- ☆21Jul 11, 2024Updated last year
- Transcription and diarization (speaker identification)☆33May 31, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- An example of serialport working with electron☆10Jan 20, 2021Updated 5 years ago
- Easily share your custom workflows for anyone to run☆22Oct 17, 2024Updated last year
- ☆12Jan 29, 2026Updated 3 months ago
- 🦜Have a conversation with anyone from any youtube video.☆24Apr 26, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Arduino automatic 2 stereo channel audio input switch.☆13Jun 22, 2020Updated 5 years ago
- Workflows for generating AV editions and exhibits using IIIF manifests by HiPSTAS and Brumfield Labs.☆17Nov 17, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- A series of CSL files that output Markdown-formatted references for common citation styles.☆18Jun 22, 2021Updated 4 years ago
- Easily download videos from Azure Media Services with Python on any platform. ⚡️☆20Apr 27, 2026Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- a open source iOS framework☆14Jun 17, 2015Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆440May 12, 2026Updated last week
- Auto generated swig python module with a binary compnent☆11Apr 19, 2012Updated 14 years ago
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- Reading engine - Render epubs in the browser or mobile☆21Updated this week
- Merge and clean up multi-line and multi-language subtitle files. Updated with language-based subtitle split. 将带有多行英文的SRT字幕合并成单行,同时合并中文翻译。…☆14Mar 30, 2015Updated 11 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Nov 9, 2023Updated 2 years ago
- ☆15Oct 19, 2024Updated last year
- ☆20Feb 14, 2025Updated last year
- Custom Stepper View☆17Oct 30, 2014Updated 11 years ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Web app designed to enhance your interaction with OpenAI's language models☆12Jun 14, 2023Updated 2 years ago
- ☆14Nov 28, 2022Updated 3 years ago