ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated last year
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 8 months ago
- The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of si…☆57Updated last year
- ☆16Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- "The-Rasa-Answer-Machine-GPT3" is an advanced chatbot equipped to answer questions and offer useful info. Constructed with Rasa & GPT-3, …☆25Updated 2 years ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆29Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated last year
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆22Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆40Updated last year
- playground for custom gpts built with agency-swarms (https://github.com/VRSEN/agency-swarm)☆15Updated last year
- ☆16Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- 😎 Awesome list of tools and projects with the awesome LangChain framework☆17Updated last year
- Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sources☆22Updated 11 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆37Updated 7 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆19Updated 4 months ago
- Video Voiceover with gpt-4o-mini☆33Updated 9 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated this week
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated last year
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- ☆45Updated 11 months ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆35Updated 2 weeks ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆62Updated 8 months ago
- ☆30Updated last year
- ☆12Updated 11 months ago
- ☆21Updated 7 months ago
- Translate any text using GPT.☆16Updated 2 years ago