gladiaio / gladia-cliLinks
☆18Updated 9 months ago
Alternatives and similar repositories for gladia-cli
Users that are interested in gladia-cli are comparing it to the libraries listed below
Sorting:
- web based editor for subtitles and transcripts☆142Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- OpenAI Whisper + davinci for podcast summarization☆71Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- Whatsapp Web Speech To Text☆54Updated 2 years ago
- This project presents a comprehensive study on video dubbing techniques and the development of a specialized video dubbing system.☆11Updated 2 years ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- On-device noise suppression powered by deep learning☆74Updated 2 months ago
- A python library to find differences between audio and transcriptions☆19Updated last year
- Seamless Voice Interactions with LLMs☆12Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆30Updated last month
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆65Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆15Updated 9 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆50Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- A curated list of awesome voice activity detection☆66Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- YoutubeGPT is a web application powered by OpenAI's Whisper model for speech recognition and GPT-3 for text summarization. It extracts tr…☆17Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆38Updated 3 weeks ago