ElmiraGhorbani / gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
☆12Updated last year
Alternatives and similar repositories for gpt-speaker-diarization:
Users that are interested in gpt-speaker-diarization are comparing it to the libraries listed below
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 5 months ago
- "The-Rasa-Answer-Machine-GPT3" is an advanced chatbot equipped to answer questions and offer useful info. Constructed with Rasa & GPT-3, …☆24Updated 2 years ago
- A python library to find differences between audio and transcriptions☆18Updated last year
- The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of si…☆56Updated last year
- Transcription and diarization (speaker identification)☆31Updated last year
- ☆16Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- Query, ask and chat with a document-index via transformer models!☆17Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆93Updated 11 months ago
- ☆11Updated 8 months ago
- Leveraging OpenAI's Whisper ASR and GPT-4 models to automate the process of generating meeting minutes from audio recordings, as well as …☆18Updated last year
- Talking Face Generation system☆19Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆42Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 7 months ago
- A curated list of awesome OpenAI's Whisper☆99Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 3 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- Text To Speech Multilingual Support (+20 Language)☆42Updated last year
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆51Updated 5 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated 10 months ago
- LipSync AI is your ultimate solution for flawless lip-syncing in videos. Our AI model precisely synchronizes audio and video, creating li…☆9Updated last year
- ☆16Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 2 months ago
- ☆12Updated last year