ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 2 months ago
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated 4 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- ☆17Updated last year
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆11Updated last year
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated 2 years ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆40Updated 2 years ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated last year
- ☆19Updated last year
- ☆17Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- ☆12Updated last year
- 😎 Awesome list of tools and projects with the awesome LangChain framework☆19Updated 2 years ago
- Speech to text to speech using Elevenlabs☆28Updated 2 years ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆41Updated this week
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Auto-Video maker handling many AI's☆11Updated last year
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆33Updated 11 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated 2 years ago
- ☆40Updated last year
- ☆18Updated 4 months ago
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Updated 2 years ago
- ☆16Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Updated last year
- VideoDB Python SDK☆86Updated this week