Majdoddin / lexicaps
Transcription and Diarization based on OpenAI's Whisper
☆21Updated last year
Alternatives and similar repositories for lexicaps:
Users that are interested in lexicaps are comparing it to the libraries listed below
- web based editor for subtitles and transcripts☆126Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- This project demonstrates how to parse emails, process them using OpenAI's GPT-3.5, and load the data into a Weaviate vector database for…☆20Updated last year
- ☆37Updated last year
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 7 months ago
- This project aims to combine the latest LLMs, Multi-Step Asynchronous Function Calling, Natural Language Processing, and Text-to-Speech.☆37Updated 11 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆67Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆45Updated last month
- auto fine tune of models with synthetic data☆75Updated last year
- Speaker prediction for captions on the Lex Fridman podcast☆25Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆91Updated last month
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆38Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- ez audio transcription tool with flexible processing and post-processing options☆147Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆200Updated 9 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆50Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name, description, model of assistant and …☆18Updated last year
- Simli WebRTC AI Agent demo☆20Updated 3 months ago
- The official Cartesia client for Python.☆67Updated last week
- A real time offline transcriber with gui, based on OpenAI whisper☆15Updated last year
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆59Updated last year
- A curated list of awesome OpenAI's Whisper☆99Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆57Updated this week
- ☆36Updated 2 years ago
- ☆59Updated last year
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆65Updated last year