Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Aug 13, 2023Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- Power BI Custom Visual - Process Control Chart☆12Jun 14, 2017Updated 8 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated last month
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- Planning docs and PRDs for the Claude Code Agentic RAG Masterclass video series. Build a full-stack AI app with Python, React, and Supaba…☆54Updated this week
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆10Nov 12, 2024Updated last year
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Dec 7, 2024Updated last year
- Chatbot for NHS Medicines A-Z. Agentic Retrieval Augmented Generation utilising the OpenAI API, LangChain, and LangGraph to query a vecto…☆10Jun 24, 2024Updated last year
- This app uses OpenAI's LLM model to answer questions about your PDF file. Upload your PDF file and ask questions about it. The app will r…☆13May 13, 2025Updated 9 months ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆25Oct 19, 2025Updated 4 months ago
- Simple LLM-enabled document Q&A app built using Langchain and Streamlit☆10Dec 4, 2024Updated last year
- real-time web visualizer for 3D gaussian splatting☆10Jan 31, 2025Updated last year
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Apr 23, 2025Updated 10 months ago
- Agent building tools via block diagram UI☆12Dec 31, 2025Updated 2 months ago
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆20Jun 7, 2025Updated 9 months ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- ☆11May 2, 2022Updated 3 years ago
- Python speech recognition script utilizing the Dragonfly library for speech recognition on Windows☆13Feb 6, 2026Updated last month
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- A Cyberpunk 2077 First-Person Multi Rig for Blender (4.0+)☆11Jan 10, 2026Updated 2 months ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 9 months ago
- A non-slop skill creator for competent expert-level skills. Extract expertise through guided interviews or expert conversations, separate…☆23Dec 24, 2025Updated 2 months ago
- A project about Virtual Try-On. Lines of code ~5,200.☆10Jan 27, 2021Updated 5 years ago
- An OpenAI GPT-powered chatbot utilizing deforum stable diffusion to aid developers in efficiently searching through documentations.☆13Jun 24, 2023Updated 2 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 9 months ago
- ☆10Apr 22, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Research on algorithms for garment perception, manipulation...☆12Sep 15, 2023Updated 2 years ago
- ☆12Sep 4, 2023Updated 2 years ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆14Apr 2, 2025Updated 11 months ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆16Oct 6, 2025Updated 5 months ago
- Rendering SMPL using neural-mesh-render!!☆12Aug 6, 2020Updated 5 years ago
- ☆15Oct 10, 2023Updated 2 years ago
- Human Pose Estimation in Real-World Metric Coordinates☆12Jul 6, 2023Updated 2 years ago
- Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".☆20Dec 30, 2024Updated last year