Faster Whisper ASR transcription with CTranslate2
☆25Oct 25, 2024Updated last year
Alternatives and similar repositories for faster-whisper
Users that are interested in faster-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 5 months ago
- ☆31Oct 29, 2024Updated last year
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 7 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆15Mar 15, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆39Apr 6, 2026Updated 2 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- ☆23Jul 10, 2025Updated 11 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆59Aug 22, 2025Updated 9 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- ☆28Feb 11, 2026Updated 4 months ago
- Voice conversion with just linear regression.☆37Sep 25, 2025Updated 8 months ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files☆76Apr 27, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆38Aug 10, 2025Updated 10 months ago
- A long-context eval☆129Jun 5, 2026Updated last week
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆18May 9, 2025Updated last year
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆95May 5, 2026Updated last month
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆51Jul 14, 2024Updated last year
- ☆10Jan 8, 2025Updated last year
- ☆55Jul 16, 2025Updated 10 months ago
- Example repo showcasing model training and deployment with distil claude cli skill☆54Jan 19, 2026Updated 4 months ago
- Find Niquests at https://github.com/jawah/niquests HTTP/2 HTTP/3 QUIC Async☆12Oct 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2026] SR-Scientist: Scientific Equation Discovery With Agentic AI☆48Jan 27, 2026Updated 4 months ago
- Minimal example of MCP for parsing llms.txt☆39Apr 8, 2025Updated last year
- Offical implementation of "Life-Harness"☆173Jun 2, 2026Updated last week
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆58May 17, 2025Updated last year
- A Medical / Clinical Note Taking Demo Application using Deepgram Voice Agent API☆16Jul 9, 2025Updated 11 months ago
- A collection of GGUF and quantizations for jina-embeddings-v4☆38Sep 18, 2025Updated 8 months ago
- Application of Retrieval-Augmented Reasoning on a domain-specific body of knowledge☆35Feb 27, 2026Updated 3 months ago
- Manipulate audio with a simple and easy high level interface☆20Jan 10, 2026Updated 5 months ago
- OpenBrowser is an open-source, AI-native browser built on Chromium — a truly privacy-first alternative to ChatGPT Atlas, Perplexity Comet…☆55Feb 24, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Jun 9, 2023Updated 3 years ago
- python client library☆10Feb 15, 2017Updated 9 years ago
- Prompt Brewery☆53Aug 8, 2025Updated 10 months ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 6 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆41Feb 5, 2026Updated 4 months ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆66Jan 16, 2025Updated last year
- Crockford Base32 encoding for PostgreSQL unsigned integers☆13Sep 8, 2019Updated 6 years ago