THUsatlab / BERT-LIDLinks
Leveraging BERT to Improve Spoken Language Identification
☆16Updated 2 years ago
Alternatives and similar repositories for BERT-LID
Users that are interested in BERT-LID are comparing it to the libraries listed below
Sorting:
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆53Updated last year
- ☆152Updated 2 years ago
- ☆13Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆129Updated 3 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆40Updated 2 years ago
- ☆88Updated 5 months ago
- MagicData-RAMC Dataset and Baseline☆55Updated 3 years ago
- SpEx+(tied) source code☆88Updated 2 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- ☆33Updated 3 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 2 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated last year
- ☆50Updated 4 years ago
- ☆129Updated 4 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆160Updated 5 months ago
- ☆43Updated 2 years ago
- ☆37Updated 4 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆58Updated last year
- ☆11Updated 11 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆43Updated 5 months ago
- ☆30Updated 3 years ago
- multi-scale time domain speaker extraction☆65Updated 4 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆79Updated 3 months ago
- ☆39Updated 10 months ago
- Code for calculate DNS_MOS.☆41Updated 2 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆42Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆45Updated 4 months ago
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated last year
- ☆57Updated 2 years ago