ElmiraGhorbani / gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆13Updated last year
Alternatives and similar repositories for gpt-speaker-diarization:
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
- Transcription and diarization (speaker identification)☆34Updated last year
- "The-Rasa-Answer-Machine-GPT3" is an advanced chatbot equipped to answer questions and offer useful info. Constructed with Rasa & GPT-3, …☆25Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- ☆12Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆38Updated 3 weeks ago
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆17Updated last year
- ☆16Updated last year
- A proposed GPT chatbot for teachers that uses retrieval-augmentation to answer questions about their students.☆10Updated 4 months ago
- ☆39Updated 11 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 8 months ago
- The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of si…☆56Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆35Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆16Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- canvas-based talking head model using viseme data☆31Updated last year
- Experiment for creating a safe companion chatbot (according to OpenAI rules)☆13Updated 2 years ago
- ☆16Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- A lightweight Python library for running TTS models with a unified API.☆18Updated 2 months ago
- A testing repo to share code and thoughts on diarisation☆55Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 months ago
- ☆11Updated 9 months ago
- On-device speaker diarization powered by deep learning☆44Updated last month
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year