runpod-workers / worker-faster_whisper
š§ | RunPod worker of the faster-whisper model for Serverless Endpoint.
ā76Updated last month
Alternatives and similar repositories for worker-faster_whisper:
Users that are interested in worker-faster_whisper are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā90Updated 8 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.ā37Updated last month
- On-device streaming text-to-speech engine powered by deep learningā62Updated this week
- ā154Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationā110Updated 11 months ago
- Video+code lecture on building nanoGPT from scratchā65Updated 7 months ago
- š | A simple worker that can be used as a starting point to build your own custom RunPod Endpoint API worker.ā88Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extractionā81Updated 8 months ago
- Generate visual podcasts about novels using open source modelsā24Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.ā46Updated last year
- A simple TTS server for generating speech using StyleTTS2ā32Updated last year
- Cog wrapper for collabora/WhisperSpeechā25Updated 10 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.ā113Updated 7 months ago
- LLaVA server (llama.cpp).ā176Updated last year
- ā90Updated 8 months ago
- ā37Updated last year
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā148Updated 6 months ago
- https://hf.co/hexgrad/Kokoro-82Mā41Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.ā47Updated last month
- ASR + diarization model server with speculative decodingā53Updated 7 months ago
- š¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.ā202Updated 2 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated 8 months ago
- ā24Updated last year
- ā107Updated 3 weeks ago
- Speaker Diarization with Transformersā61Updated 7 months ago
- An JS web client for connecting to Pipecat bots with voice and visionā42Updated 3 weeks ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIā44Updated 3 months ago
- Estimate Your LLM's Token Toll Across Various Platforms and Configurationsā30Updated 5 months ago
- VideoDB Python SDKā63Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingfaceā90Updated 6 months ago