er537 / whisper_interpretabilityLinks
A repo to do interpretability of pre-trained acoustic models
☆15Updated 2 years ago
Alternatives and similar repositories for whisper_interpretability
Users that are interested in whisper_interpretability are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆31Updated 7 months ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆19Updated 2 years ago
- ☆85Updated last year
- ☆56Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 7 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆84Updated 3 weeks ago
- Official code for Wav2Seq☆96Updated 3 years ago
- ☆24Updated last year
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆30Updated 7 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆28Updated last month
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- Collection of scripts from mHuBERT-147.☆32Updated 11 months ago
- ☆66Updated last year
- A spoken version of the textual story cloze benchmark☆19Updated 2 years ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆47Updated 2 months ago
- Official implementation of MelHuBERT☆68Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆37Updated 2 years ago
- ☆31Updated 7 months ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆69Updated 3 years ago
- Audiogen Codec☆143Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- ☆20Updated 2 years ago
- This repository contains the SpeechBrain Benchmarks☆128Updated 3 months ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 5 months ago
- ASR text preprocessing utility☆21Updated last year
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆144Updated 11 months ago