ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated 3 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 2 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆21Updated 10 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆33Updated 10 months ago
- LipSync AI is your ultimate solution for flawless lip-syncing in videos. Our AI model precisely synchronizes audio and video, creating li…☆12Updated 2 years ago
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…☆84Updated 2 years ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆49Updated last year
- ☆12Updated last year
- VideoDB Python SDK☆84Updated this week
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Updated last year
- ☆19Updated last year
- ☆17Updated last year
- ☆16Updated 2 years ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23Updated 7 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- playground for custom gpts built with agency-swarms (https://github.com/VRSEN/agency-swarm)☆14Updated last year
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Updated 2 years ago
- Small demos demonstrating different capabilities of LiveKit Agents☆22Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- A chrome extention for quering a local llm model using llama-cpp-python, includes a pip package for running the server, 'pip install loca…☆18Updated 2 years ago
- ☆57Updated last week
- ☆15Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- An open-source translation agent designed to enhance the quality of text translations by leveraging large language models☆17Updated 4 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆15Updated last year
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated 2 years ago