ElmiraGhorbani / gpt-speaker-diarizationLinks
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆14Updated 2 years ago
Alternatives and similar repositories for gpt-speaker-diarization
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
Sorting:
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated 3 weeks ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create spe…☆14Updated last year
- ☆17Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated last week
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Updated 9 months ago
- Small demos demonstrating different capabilities of LiveKit Agents☆18Updated 6 months ago
- ASR + diarization model server with speculative decoding☆62Updated last year
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆13Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆74Updated last week
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…☆85Updated 2 years ago
- Retrieval Augmented Generation for youtube videos with a BRAD agent☆33Updated 7 months ago
- The UnisonAI Multi-Agent Framework (A2A) provides a flexible and extensible environment for creating and coordinating multiple autonomous…☆21Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 2 weeks ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 10 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- ☆17Updated 11 months ago
- Transcription and diarization (speaker identification)☆33Updated 2 years ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated last month
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆41Updated 5 months ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆89Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆48Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- ☆21Updated 10 months ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆40Updated last month
- Talking Face Generation system☆19Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago