ElmiraGhorbani / gpt-speaker-diarizationLinks

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

☆14

Alternatives and similar repositories for gpt-speaker-diarization

Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below

Sorting:

justinjohn0306 / SpeedScribe
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…
☆10Updated 8 months ago
ElmiraGhorbani / chatgpt-long-term-memory
The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of si…
☆57Updated last year
TengHu / Interactive-RAG
☆16Updated last year
JakeFurtaw / Chat-RAG
Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…
☆22Updated last month
shamspias / The-Rasa-Answer-Machine-GPT3
"The-Rasa-Answer-Machine-GPT3" is an advanced chatbot equipped to answer questions and offer useful info. Constructed with Rasa & GPT-3, …
☆25Updated 2 years ago
tarzain / crosstalk
a simple system for 2-way interruptible voice interactions between human and LLM
☆29Updated last year
mesolitica / multimodal-LLM
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆18Updated last year
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆62Updated last month
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆52Updated 6 months ago
DAMO-NLP-SG / Multipurpose-Chatbot
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
☆19Updated last year
kyegomez / ProfitPilot
ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…
☆22Updated last year
mallahyari / RealtimeSTT-TTS
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆40Updated last year
kevon217 / custom-agency-swarms
playground for custom gpts built with agency-swarms (https://github.com/VRSEN/agency-swarm)
☆15Updated last year
camenduru / video-dubbing-colab
☆16Updated last year
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆20Updated 8 months ago
assafelovic / awesome-langchain
😎 Awesome list of tools and projects with the awesome LangChain framework
☆17Updated last year
hanantabak2 / AI_Research_Assistant_CrewAI_RAG
Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sources
☆22Updated 11 months ago
NidumAI-Inc / agent-studio
Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…
☆37Updated 7 months ago
AI4WA / OpenOmniFramework
Multimodal Open Source Framework for Conversational Agent Research and Development.
☆19Updated 4 months ago
martintomov / gpt4v-video-voiceover
Video Voiceover with gpt-4o-mini
☆33Updated 9 months ago
The-Swarm-Corporation / OmniParse
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆19Updated this week
arham-kk / openai-tts
This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…
☆14Updated last year
laelhalawani / gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…
☆12Updated last year
leporejoseph / PraisonAi-Streamlit
☆45Updated 11 months ago
deepgram-starters / flask-live-transcription
Get started using Deepgram's Live Transcription with this Flask demo app
☆35Updated 2 weeks ago
HallowSiddharth / VoiceCraftAI
VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.
☆62Updated 8 months ago
BennyKok / leaked-zoom
☆30Updated last year
Ganesh-tamang / LivePortrait_video
☆12Updated 11 months ago
gradio-app / sambanova-gradio
☆21Updated 7 months ago
nestordemeure / GPTranslate
Translate any text using GPT.
☆16Updated 2 years ago