Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.
☆47Mar 30, 2025Updated last year
Alternatives and similar repositories for offline-whisperx
Users that are interested in offline-whisperx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcript…☆26Feb 5, 2026Updated 2 months ago
- Transcription and diarization (speaker identification)☆34May 31, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jul 23, 2024Updated last year
- Local LLM set-up☆18Jul 1, 2024Updated last year
- 🦜Have a conversation with anyone from any youtube video.☆24Apr 26, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- a open source iOS framework☆14Jun 17, 2015Updated 10 years ago
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Dec 24, 2024Updated last year
- Auto generated swig python module with a binary compnent☆11Apr 19, 2012Updated 13 years ago
- Summarize (and translate) text using ChatGPT or a local LLM, with support for multiple large text files, PDF files. Preserves original st…☆18Feb 14, 2026Updated last month
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- ☆11Aug 26, 2024Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- Merge and clean up multi-line and multi-language subtitle files. Updated with language-based subtitle split. 将带有多行英文的SRT字幕合并成单行,同时合并中 文翻译。…☆14Mar 30, 2015Updated 11 years ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- A collections of tools around sleep research: plotting of hypnograms / spectrograms, etc etc☆10Jan 24, 2026Updated 2 months ago
- ☆15Oct 19, 2024Updated last year
- ☆11Sep 2, 2023Updated 2 years ago
- PyQt(+PySide) Stable Diffusion GUI☆15Aug 1, 2023Updated 2 years ago
- Slidable Panel in Swift☆11Jul 18, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16May 1, 2021Updated 4 years ago
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- AI News Anchor Generator App built using Midjourney, D-ID, OpenAI, NewsAPI, and Streamlit.☆17Sep 18, 2023Updated 2 years ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 10 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- This app uses OpenAI's LLM model to answer questions about your PDF file. Upload your PDF file and ask questions about it. The app will r…☆14May 13, 2025Updated 10 months ago
- Code examples for the book "Deep Learning for Audio: A Comprehensive Journey From Theory to Deployment"☆19Apr 10, 2020Updated 6 years ago