calclavia / tal-asrd
Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations
☆38Updated last year
Alternatives and similar repositories for tal-asrd:
Users that are interested in tal-asrd are comparing it to the libraries listed below
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- ☆56Updated last month
- Python toolkit for speech processing☆68Updated last month
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆22Updated 2 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 3 years ago
- Discriminative Condition-Aware PLDA☆43Updated 9 months ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- The VoxTube dataset official repository☆68Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated last year
- ☆59Updated 4 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 3 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- A simple package for Guided source separation (GSS)☆121Updated 11 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆52Updated 2 months ago
- Alignment files of LibriTTS.☆61Updated 5 years ago
- ☆30Updated last year
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 4 months ago
- experiments about AudioSet☆44Updated last year
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago