er537 / whisper_interpretabilityLinks
A repo to do interpretability of pre-trained acoustic models
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_interpretability
Users that are interested in whisper_interpretability are comparing it to the libraries listed below
Sorting:
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Updated 3 years ago
- ☆23Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆33Updated 10 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- ☆67Updated last year
- ☆24Updated last year
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆98Updated 2 weeks ago
- ☆86Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆70Updated 3 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Updated 4 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Updated last year
- ☆56Updated 3 years ago
- A spoken version of the textual story cloze benchmark☆20Updated 2 years ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆48Updated 5 months ago
- Implementation of DiffWave and SaShiMi audio generation models☆128Updated 2 years ago
- Official code for Wav2Seq☆97Updated 3 years ago
- ☆32Updated last month
- Fast and differentiable hidden Markov model in C++☆19Updated 3 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆118Updated last year
- Audiogen Codec☆144Updated last year
- ☆21Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Updated 2 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Updated 10 months ago
- Viterbi decoding in PyTorch☆40Updated 5 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Updated last year
- Official implementation of MelHuBERT☆68Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆36Updated 11 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Updated 7 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆59Updated 7 months ago