gladiaio / gladia-cliLinks
☆19Updated last year
Alternatives and similar repositories for gladia-cli
Users that are interested in gladia-cli are comparing it to the libraries listed below
Sorting:
- A curated list of awesome OpenAI's Whisper☆102Updated 2 years ago
- web based editor for subtitles and transcripts☆143Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- OpenAI Whisper + davinci for podcast summarization☆70Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- On-device noise suppression powered by deep learning☆81Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- ☆32Updated 2 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆54Updated 3 weeks ago
- Browser extension to help users find and manage scholarships.☆19Updated 11 months ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆51Updated 3 weeks ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- Whatsapp Web Speech To Text☆54Updated 2 years ago
- An open-source Claude 3 prompt optimizer☆13Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- Utility functions for python data pipelines with generators.☆22Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- Transcription with speaker diarization pipeline☆98Updated 2 years ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated last year
- Play.ht's Text to Speech API☆94Updated 5 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated 2 years ago
- Run OpenAI Whisper as a Cog model☆69Updated last year
- A curated list of awesome voice activity detection☆71Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ivrit.ai codebase☆44Updated 3 months ago