metame-ai / faster-distil-whisper
Faster distil-whisper transcription with CTranslate2
☆14Updated last year
Alternatives and similar repositories for faster-distil-whisper
Users that are interested in faster-distil-whisper are comparing it to the libraries listed below
Sorting:
- Faster Whisper transcription with CTranslate2☆8Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆26Updated 9 months ago
- Enhancing Translation with RAG-Powered Large Language Models☆81Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 10 months ago
- ☆96Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆23Updated last month
- Adds a web API to RVC to infer via json requests☆23Updated 10 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆148Updated 3 weeks ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- ☆53Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆55Updated last month
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Pybind11 bindings for Whisper.cpp☆57Updated 2 weeks ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆63Updated last week
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆28Updated last month
- web based editor for subtitles and transcripts☆130Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 7 months ago
- Official implementation of the TTS model Lina-Speech☆165Updated 4 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆111Updated this week
- ☆256Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆243Updated 11 months ago
- A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows☆65Updated 7 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- ☆124Updated 10 months ago