bofenghuang / community-events
Place where folks can contribute to π€ community events
β9Updated last year
Alternatives and similar repositories for community-events:
Users that are interested in community-events are comparing it to the libraries listed below
- Real-time Speech Separation, Noise Suppression & Speaker Recognitionβ17Updated 5 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β78Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ46Updated 7 months ago
- This project is about performing Speaker diarization for Hindi Language.β48Updated 3 years ago
- Clustering-based methods for overlapping diarizationβ74Updated last year
- Online streaming speaker change detection model in Pytorchβ37Updated last year
- β70Updated last year
- β34Updated 4 months ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"β115Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ64Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ94Updated 2 weeks ago
- β41Updated 2 years ago
- β38Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paperβ21Updated 2 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Rivaβ83Updated last month
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfacβ¦β79Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ107Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β15Updated 2 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based β¦β109Updated last week
- Predicts the level of noise and reverberation on your audiofilesβ144Updated 8 months ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- β29Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!β33Updated last year
- A list of papers for child ASRβ35Updated 3 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTMβ39Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2β90Updated 3 years ago
- β56Updated 2 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.β46Updated 3 years ago
- Various speech datasets made available to the publicβ110Updated last month
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β86Updated 2 years ago