idiap/bert-text-diarization-atc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idiap/bert-text-diarization-atc)

idiap / bert-text-diarization-atc

This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)

☆17

Alternatives and similar repositories for bert-text-diarization-atc

Users that are interested in bert-text-diarization-atc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idiap / w2v2-air-traffic
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆42Jul 10, 2024Updated 2 years ago
idiap / atco2-corpus
View on GitHub
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
☆89Mar 24, 2023Updated 3 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
DFRobot / Maqueen_Plus_HuskyLens_TutorialMindplus_version_EN
View on GitHub
☆13Jun 1, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
MaiAnh871 / melpe
View on GitHub
☆10Apr 20, 2022Updated 4 years ago
aizhiqi-work / OpenKWS
View on GitHub
开源自定义唤醒词
☆17Dec 24, 2025Updated 7 months ago
kyegomez / MultiQueryAttention
View on GitHub
This is a simple torch implementation of the high performance Multi-Query Attention
☆16Aug 23, 2023Updated 2 years ago
maelfabien / EM_GMM_HMM
View on GitHub
Illustrating EM for GMMs and HMMs
☆12May 9, 2020Updated 6 years ago
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
mt-upc / SHAS
View on GitHub
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆44Feb 9, 2023Updated 3 years ago
mncosta / biogeme_tutorial
View on GitHub
Introductory tutorial to Biogeme
☆23Apr 28, 2022Updated 4 years ago
yphacker / kesci
View on GitHub
kesci
☆11Jul 12, 2019Updated 7 years ago
kamperh / vqwordseg
View on GitHub
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆39May 5, 2026Updated 2 months ago
BUTSpeechFIT / diacorrect
View on GitHub
Error correction back-end for speaker diarization
☆18Sep 26, 2023Updated 2 years ago
sarulab-speech / jsut-label
View on GitHub
context labels and pronunciation data for JSUT corpus
☆77Sep 2, 2021Updated 4 years ago
Ydkwim / CTAL
View on GitHub
Pre-training Cross-modal Transformer for Audio-and-Language Representations
☆39Apr 20, 2021Updated 5 years ago
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JuanPZuluaga / accent-recog-slt2022
View on GitHub
Repository for Accent Recognition (Hackathon @SLT2022)
☆43May 12, 2024Updated 2 years ago
l0b0 / xautolock
View on GitHub
This is just a copy of the original sources. I do not maintain this repository.
☆19Feb 26, 2021Updated 5 years ago
Ephrem-ETH / E2E-KWS
View on GitHub
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆45Nov 18, 2022Updated 3 years ago
mrusci / ondevice-learning-kws
View on GitHub
Test Framework for few-shot open set KWS
☆45Nov 8, 2024Updated last year
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
doerlbh / MiniVox
View on GitHub
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
☆29Sep 20, 2021Updated 4 years ago
axegon / spectrust
View on GitHub
⚠️REPO HAS BEEN MIGRATED ⚠️https://codeberg.org/axegon/spectrust
☆23Apr 20, 2020Updated 6 years ago
docugami / DFM-benchmarks
View on GitHub
Benchmarks for Business Document Foundation Models
☆10Apr 4, 2024Updated 2 years ago
GoodAI / HALLM
View on GitHub
A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.
☆13Aug 28, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
neondatabase / neon-api-python
View on GitHub
a Python client for the Neon API
☆19Aug 23, 2025Updated 11 months ago
secretsauceai / mfcc-rust
View on GitHub
☆21Jul 23, 2023Updated 3 years ago
JayZeeDesign / reddit-replydude
View on GitHub
☆16Apr 12, 2024Updated 2 years ago
cheoljun95 / sdhubert
View on GitHub
☆27Dec 4, 2024Updated last year
Jityan / sslprotonet
View on GitHub
Code Repository for "SSL-ProtoNet: Self-supervised Learning Prototypical Networks for few-shot learning"
☆29Oct 8, 2024Updated last year
IJDykeman / simple_depth_from_motion
View on GitHub
☆30Mar 21, 2021Updated 5 years ago