tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0View external linksLinks
☆18Mar 13, 2024Updated last year
Alternatives and similar repositories for Federated-Learning-ASR-based-on-wav2vec-2.0
Users that are interested in Federated-Learning-ASR-based-on-wav2vec-2.0 are comparing it to the libraries listed below
Sorting:
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 3 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆51Feb 21, 2024Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- ☆14Nov 26, 2024Updated last year
- Anaouder mouezh e Brezhoneg gant Vosk☆16Nov 24, 2025Updated 2 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- ☆14Feb 9, 2023Updated 3 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 2 years ago
- ☆18Mar 4, 2023Updated 2 years ago
- ☆13Sep 12, 2024Updated last year
- ☆37Jun 28, 2021Updated 4 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆49May 14, 2025Updated 9 months ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- Visual Speech Recongnition☆19Dec 24, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- ☆24Jan 14, 2021Updated 5 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Sep 11, 2023Updated 2 years ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- Survey on speech generation work.☆21Nov 26, 2023Updated 2 years ago
- ☆24Feb 20, 2024Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- ☆54Jul 1, 2024Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- ☆23Jan 6, 2023Updated 3 years ago