mobiusml / faster-whisperLinks
Faster Whisper ASR transcription with CTranslate2
☆22Updated 8 months ago
Alternatives and similar repositories for faster-whisper
Users that are interested in faster-whisper are comparing it to the libraries listed below
Sorting:
- Speaker diarization service☆23Updated 3 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆65Updated 3 weeks ago
- Audio tokenization, in the fastest way possible!☆52Updated 10 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 8 months ago
- proof of concept conversation orchestrator with a speech-language model☆20Updated 8 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆32Updated last month
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆64Updated 2 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- ☆16Updated 3 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated last week
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- ☆16Updated 4 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆14Updated 9 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 4 months ago
- Open-source and reproducible benchmarks for Speaker Diarization☆29Updated last week
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆20Updated 4 months ago
- Speaker Diarization with Transformers☆68Updated last month
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- ☆51Updated 2 weeks ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year