ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆33Updated 10 months ago
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Updated 11 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Updated 2 years ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- LipSync AI is your ultimate solution for flawless lip-syncing in videos. Our AI model precisely synchronizes audio and video, creating li…☆12Updated 2 years ago
- Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your a…☆11Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆140Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆50Updated last year
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆21Updated 9 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆88Updated this week
- An audio processing tool for detecting and removing silence in audio recordings. Create text files for video silence removal using custom…☆25Updated 6 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 2 months ago
- ☆22Updated last year
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated 3 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…☆84Updated 2 years ago
- VideoDB Python SDK☆84Updated last week
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated 2 years ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated 2 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Updated 5 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated last month
- ☆17Updated 2 years ago
- Open Sourced NoteBookLM☆59Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 3 years ago
- Open Source Study Assistant☆44Updated last year
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Updated 2 years ago