ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆28Updated 5 months ago
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆33Updated last year
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 3 months ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆22Updated 11 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 4 months ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Updated last year
- Used GPT for Realtime AI (Artificial intelligence) tutor to help students, learn by talking screenshots of there work.☆13Updated last year
- ☆12Updated last year
- Open Source Study Assistant☆44Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated last month
- The UnisonAI Multi-Agent Framework built on custom workflow which allows ai agents to talk together and provides a flexible and extensibl…☆24Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆91Updated 3 weeks ago
- Experiment on QnA tabular data using LLMs and SQL☆28Updated last year
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- 🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra…☆32Updated 7 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Updated 7 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆14Updated 4 months ago
- An audio processing tool for detecting and removing silence in audio recordings. Create text files for video silence removal using custom…☆25Updated 8 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Updated last year
- Small demos demonstrating different capabilities of LiveKit Agents☆23Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- ☆21Updated last year
- ☆16Updated 2 years ago
- ☆13Updated last year