Faster Whisper ASR transcription with CTranslate2
☆24Oct 25, 2024Updated last year
Alternatives and similar repositories for faster-whisper
Users that are interested in faster-whisper are comparing it to the libraries listed below
Sorting:
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- A long-context eval☆69Updated this week
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated 2 months ago
- Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files☆70Feb 13, 2026Updated 2 weeks ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 9 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- ☆31Oct 29, 2024Updated last year
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated 10 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆36Feb 5, 2026Updated last month
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆37Dec 31, 2025Updated 2 months ago
- Study and research with your docs, media, and AI in one place☆33Updated this week
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆36Aug 10, 2025Updated 6 months ago
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆48Oct 4, 2025Updated 5 months ago
- ☆13Nov 5, 2024Updated last year
- Realtime voice agents for role play and more.☆41Mar 7, 2025Updated 11 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- ☆13Oct 9, 2025Updated 4 months ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆12Jun 24, 2024Updated last year
- An simplest PE parser, which list all import and export entries☆12Oct 11, 2018Updated 7 years ago
- Cordova plugin for jitsi meet react native sdk☆10Jun 7, 2019Updated 6 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆19Jan 15, 2026Updated last month
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- ☆21Jul 10, 2025Updated 7 months ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Whisper finetuning☆16Apr 9, 2025Updated 10 months ago
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆17Nov 28, 2025Updated 3 months ago