nianlonggu / WhisperSegLinks
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
☆38Updated 5 months ago
Alternatives and similar repositories for WhisperSeg
Users that are interested in WhisperSeg are comparing it to the libraries listed below
Sorting:
- ☆93Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 6 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 10 months ago
- ☆66Updated last year
- Clustering-based methods for overlapping diarization☆82Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆92Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- ☆69Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆108Updated 2 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆100Updated 11 months ago
- Python package for combining diarization system outputs.☆91Updated 2 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- ☆73Updated 2 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆27Updated 10 months ago
- The VoxTube dataset official repository☆71Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆92Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- ☆35Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆95Updated last year
- Deep Articulatory Synthesis and Inversion☆54Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆172Updated 6 months ago
- ☆54Updated 2 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆73Updated last year
- A simple package for Guided source separation (GSS)☆132Updated last year
- Official repository of NeXt-TDNN for speaker verification☆80Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆39Updated last year
- Python toolkit for speech processing☆72Updated last week