metame-ai / faster-distil-whisper
Faster distil-whisper transcription with CTranslate2
☆13Updated last year
Alternatives and similar repositories for faster-distil-whisper:
Users that are interested in faster-distil-whisper are comparing it to the libraries listed below
- Faster Whisper transcription with CTranslate2☆8Updated last year
- ☆255Updated last year
- Create an LJSpeech structured voice dataset on wave input☆28Updated 6 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- ☆88Updated 2 weeks ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- ☆95Updated 11 months ago
- Official implementation of the TTS model Lina-Speech☆163Updated 3 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆48Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Collection of Open Source Speech Data☆153Updated 5 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆174Updated 6 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated 9 months ago
- ☆216Updated last month
- Enhancing Translation with RAG-Powered Large Language Models☆77Updated last month
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆253Updated last month
- create dataset from list of youtube links easily☆17Updated 2 years ago
- This is an optimized implementation of OpenAI's Whisper for multilingual transcription.☆38Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆146Updated 2 weeks ago
- Pybind11 bindings for Whisper.cpp☆55Updated 3 weeks ago
- Finetune VITS and MMS using HuggingFace's tools☆145Updated last year
- NeMo text processing for ASR and TTS☆324Updated this week
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- G2P☆218Updated last week
- TorToiSe fine-tuning with DLAS☆220Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- ☆356Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 8 months ago
- whisper.cpp bindings for python☆94Updated last year
- Tunable pipelines☆33Updated 2 months ago