idiap / bert-text-diarization-atc
☆14Updated 2 years ago
Alternatives and similar repositories for bert-text-diarization-atc:
Users that are interested in bert-text-diarization-atc are comparing it to the libraries listed below
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆12Updated 2 years ago
- ☆12Updated last month
- ☆35Updated 7 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 4 months ago
- ☆12Updated 6 months ago
- ☆9Updated 5 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Balanced Error Rate for Speaker Diarization☆29Updated 2 years ago
- ☆21Updated last month
- ☆10Updated 4 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated last week
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- ☆9Updated last week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 6 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated 11 months ago
- ☆31Updated 11 months ago
- ☆14Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- Clustering-based methods for overlapping diarization☆76Updated last year
- ☆10Updated 3 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 5 months ago
- ☆19Updated last year
- open-source Mandarian biased word dataset☆11Updated last year
- Repository for "Training Audio Captioning Models without Audio"☆9Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago