GeorgeEfstathiadis / LLM-Diarize-ASR-AgnosticView external linksLinks
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
☆20Jul 31, 2024Updated last year
Alternatives and similar repositories for LLM-Diarize-ASR-Agnostic
Users that are interested in LLM-Diarize-ASR-Agnostic are comparing it to the libraries listed below
Sorting:
- ☆11Oct 24, 2022Updated 3 years ago
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 2 years ago
- ☆17May 5, 2024Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆141Jan 27, 2026Updated 2 weeks ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 10 months ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- ☆10Aug 3, 2019Updated 6 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆13Jul 1, 2025Updated 7 months ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆10Oct 16, 2025Updated 3 months ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Neural architecture search framework based on reinforcement learning:"A Novel Approach to Detecting Muscle Fatigue Based on sEMG by Using…☆14Nov 22, 2024Updated last year
- A lecture summarization tool that uses AI and computer vision to summarize and index videos☆11Dec 8, 2022Updated 3 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- A jekyll template for easy creation of course websites. Checkout the template here:☆11Aug 1, 2024Updated last year
- A knowledge graph based forward chain inferencing engine in typescript/node.☆11Jan 23, 2021Updated 5 years ago
- Guide to Installing Ragflow on Google Cloud Compute Engine☆13Sep 12, 2024Updated last year
- This project aims to utilize Generative AI for the next marketing strategy in the case of e-commerce customer segmentation.☆12Mar 19, 2024Updated last year
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- ☆10Jul 16, 2024Updated last year
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 weeks ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.☆14Oct 23, 2025Updated 3 months ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…☆10Dec 14, 2018Updated 7 years ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- ☆10Jan 18, 2024Updated 2 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.☆11Jan 25, 2023Updated 3 years ago