muskang48 / Speaker-DiarizationView external linksLinks
This project is about performing Speaker diarization for Hindi Language.
☆59Mar 21, 2021Updated 4 years ago
Alternatives and similar repositories for Speaker-Diarization
Users that are interested in Speaker-Diarization are comparing it to the libraries listed below
Sorting:
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆11Apr 6, 2020Updated 5 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆546Sep 25, 2024Updated last year
- ☆12Feb 9, 2021Updated 5 years ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆30Mar 6, 2025Updated 11 months ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Jun 9, 2025Updated 8 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- ☆36Jan 6, 2026Updated last month
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- A text-to-audio model for generating text-conditioned drum beats☆19Apr 25, 2023Updated 2 years ago
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- ☆14Jul 24, 2025Updated 6 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- PyTorch implementation of RPNSD☆60Jun 17, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆21Jul 29, 2024Updated last year
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Dec 21, 2024Updated last year
- Spot the conversation: speaker diarisation in the wild☆157Jul 26, 2022Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Streaming Audio Models Examples in JS☆19Mar 29, 2024Updated last year
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆42Oct 30, 2025Updated 3 months ago
- ☆21Mar 7, 2023Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆43Nov 10, 2021Updated 4 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 8 months ago
- ☆18May 4, 2025Updated 9 months ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆126Apr 8, 2022Updated 3 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆87Nov 12, 2024Updated last year
- ☆20Sep 2, 2024Updated last year
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago