juice500ml / xlm_to_xlsr
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Updated 10 months ago
Alternatives and similar repositories for xlm_to_xlsr:
Users that are interested in xlm_to_xlsr are comparing it to the libraries listed below
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- A toolkit dedicate for speech evaluation.☆19Updated 4 months ago
- ☆24Updated 2 weeks ago
- ☆14Updated 4 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 4 months ago
- ☆38Updated 4 months ago
- Discriminative Training of VBx Diarization☆22Updated 4 months ago
- ☆10Updated 2 months ago
- ☆26Updated last year
- A neural speech codec based on discrete WavLM representations☆22Updated 5 months ago
- ☆46Updated 2 months ago
- ☆21Updated last year
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated 7 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆22Updated 4 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 2 months ago
- ☆14Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆11Updated last month
- Alignment examples for Interspeech 2024☆18Updated 6 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 3 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆21Updated last month
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated last year
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- ☆15Updated 6 months ago
- ☆12Updated 11 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆11Updated 3 weeks ago
- ☆48Updated last year
- Streaming Vocos☆19Updated 3 weeks ago
- ☆56Updated 3 months ago