Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
☆20Jul 31, 2024Updated last year
Alternatives and similar repositories for LLM-Diarize-ASR-Agnostic
Users that are interested in LLM-Diarize-ASR-Agnostic are comparing it to the libraries listed below
Sorting:
- ☆11Oct 24, 2022Updated 3 years ago
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- ☆17May 5, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- Find out why your CoreML model isn't running on the Neural Engine!☆30Jun 18, 2024Updated last year
- A VST plugin for Riffusion☆28Feb 1, 2023Updated 3 years ago
- MeetEval - A meeting transcription evaluation toolkit☆143Jan 27, 2026Updated last month
- ☆33May 16, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 10 months ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 11 months ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- ☆10Aug 3, 2019Updated 6 years ago
- Your AI pair programmer's memory, synced to Obsidian☆33Feb 2, 2026Updated last month
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- ☆10Jan 18, 2024Updated 2 years ago
- This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…☆10Dec 14, 2018Updated 7 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Guide to Installing Ragflow on Google Cloud Compute Engine☆13Sep 12, 2024Updated last year
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- A jekyll template for easy creation of course websites. Checkout the template here:☆11Aug 1, 2024Updated last year
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- ☆10Jul 16, 2024Updated last year
- ☆12Feb 16, 2026Updated 3 weeks ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- ☆11Aug 26, 2024Updated last year
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated last month
- A knowledge graph based forward chain inferencing engine in typescript/node.☆11Jan 23, 2021Updated 5 years ago
- Neural architecture search framework based on reinforcement learning:"A Novel Approach to Detecting Muscle Fatigue Based on sEMG by Using…☆15Nov 22, 2024Updated last year
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- Write your next novel faster and easier☆15Dec 7, 2025Updated 3 months ago
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.☆11Jan 25, 2023Updated 3 years ago
- MMER☆14Jan 8, 2026Updated 2 months ago