poloniki / quintLinks
Transcription/Chunking/Summarization of audio content.
β62Updated 2 years ago
Alternatives and similar repositories for quint
Users that are interested in quint are comparing it to the libraries listed below
Sorting:
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ527Updated 2 years ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ367Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- Pybind11 bindings for Whisper.cppβ340Updated 11 months ago
- β207Updated last year
- β91Updated 2 years ago
- faster-whisper as serverless endpointβ125Updated 6 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ232Updated 9 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β242Updated 3 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ115Updated last year
- Transcription with speaker diarization pipelineβ97Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ256Updated 3 years ago
- β490Updated 2 months ago
- Podalize: Podcast Transcription and Analysisβ159Updated last year
- π Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applicationsβ302Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Updated last year
- β551Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectβ230Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ484Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any languageβ315Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ383Updated last year
- β223Updated 2 years ago
- SemanticFinder - frontend-only live semantic search with transformers.jsβ306Updated 7 months ago
- Joint speech-language model - respond directly to audio!β372Updated last year
- β38Updated 2 years ago
- β101Updated 2 years ago
- Chat to Compose Videoβ195Updated last year
- streaming speech to text server using Whisperβ96Updated 2 years ago
- Python bindings for whisper.cppβ247Updated last year