poloniki / quintLinks
Transcription/Chunking/Summarization of audio content.
☆62Updated last year
Alternatives and similar repositories for quint
Users that are interested in quint are comparing it to the libraries listed below
Sorting:
- How to use OpenAIs Whisper to transcribe and diarize audio files☆346Updated 2 years ago
- Pybind11 bindings for Whisper.cpp☆334Updated 7 months ago
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆217Updated 4 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆110Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆214Updated 8 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆503Updated last year
- 💭 Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applications☆293Updated 2 months ago
- Podalize: Podcast Transcription and Analysis☆156Updated 10 months ago
- Python bindings for whisper.cpp☆240Updated last year
- ☆100Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆363Updated last year
- Local semantic search. Stupidly simple.☆431Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆436Updated 10 months ago
- ☆92Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆283Updated 3 months ago
- Real time speech to text transcription app.☆419Updated 2 years ago
- ☆488Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆220Updated 3 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆211Updated last year
- howdoi.ai☆255Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- TheBloke's Dockerfiles☆305Updated last year
- faster-whisper as serverless endpoint☆108Updated last month
- A command-line interface to generate textual and conversational datasets with LLMs.☆301Updated last year
- Chat to Compose Video☆189Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- Build robust LLM applications with true composability 🔗☆419Updated last year