[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing" by Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin
☆17Sep 19, 2023Updated 2 years ago
Alternatives and similar repositories for S3-Router
Users that are interested in S3-Router are comparing it to the libraries listed below
Sorting:
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- ☆31Dec 2, 2020Updated 5 years ago
- ☆14Mar 25, 2023Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆29Feb 26, 2023Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- An awesome spoken LID repository. (Working in progress☆108Apr 22, 2024Updated last year
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Dec 6, 2022Updated 3 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆127Oct 18, 2024Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- Run commands on remote hosts, inspecting key indicators to manage infrastructure☆15Jan 29, 2026Updated last month
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"☆85Sep 19, 2024Updated last year
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 9 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Oct 18, 2022Updated 3 years ago
- Official implementation for AVGN☆40Mar 24, 2023Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- ☆11Sep 4, 2023Updated 2 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- ☆93Apr 24, 2025Updated 10 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 4 months ago
- ☆11Apr 3, 2023Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Aug 31, 2023Updated 2 years ago
- VoxLingua107 recipe for SpeechBrain☆13Jul 3, 2021Updated 4 years ago
- MagicData-RAMC Dataset and Baseline☆58Sep 13, 2022Updated 3 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- ☆11May 4, 2020Updated 5 years ago
- Target Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning☆10Jul 2, 2019Updated 6 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆45Jun 30, 2024Updated last year
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Mar 12, 2023Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year