Majdoddin / lexicaps
Transcription and Diarization based on OpenAI's Whisper
☆19Updated last year
Related projects: ⓘ
- web based editor for subtitles and transcripts☆102Updated last month
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆15Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆29Updated last month
- This project aims to combine the latest LLMs, Multi-Step Asynchronous Function Calling, Natural Language Processing, and Text-to-Speech.☆37Updated 5 months ago
- ☆41Updated last year
- A langchain app to visualise a debate using Tree-of-Thought reasoning☆51Updated 6 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆62Updated 7 months ago
- An intellligent AI assistant that can do anything!☆49Updated 4 months ago
- A curated list of awesome OpenAI's Whisper☆91Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆122Updated 7 months ago
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆62Updated last year
- Wingman is the fastest and easiest way to run Llama models on your PC or Mac.☆40Updated 3 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆36Updated last year
- auto fine tune of models with synthetic data☆71Updated 7 months ago
- Autonomus AI Agent designed to engage in debate. Supports Ollama (for local LLMs) and Perplexity API.☆40Updated 4 months ago
- Self-hosted AI voice agent☆50Updated 3 weeks ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆28Updated 9 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆41Updated last month
- Multimodal Chat with Gemini API