pyf98/speech-model-compression

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pyf98/speech-model-compression)

pyf98 / speech-model-compression

A collection of papers related to speech model compression

☆27

Alternatives and similar repositories for speech-model-compression

Users that are interested in speech-model-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pyf98 / DPHuBERT
View on GitHub
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
☆118Jan 26, 2024Updated 2 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
wooseok-shin / MetricGAN-OKD
View on GitHub
Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICM…
☆30Mar 18, 2025Updated last year
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
KhanhNguyen4999 / Speech-Enhancement-CLSKD
View on GitHub
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
☆11Jun 22, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
sungnyun / ARMHuBERT
View on GitHub
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆41Aug 29, 2024Updated last year
dynamic-superb / dynamic-superb
View on GitHub
The official repository of Dynamic-SUPERB.
☆200Jun 24, 2025Updated last year
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆926Jan 5, 2023Updated 3 years ago
aixplain / NoRefER
View on GitHub
☆18Jun 5, 2026Updated last month
emuell / AFEC
View on GitHub
Cross platform audio feature extraction and sound classification tool
☆29Mar 16, 2026Updated 4 months ago
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
talhanai / wer-sigtest
View on GitHub
Script to perform statistical significance test between ASR hypotheses.
☆23Aug 13, 2017Updated 8 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
thevasudevgupta / speech-jax
View on GitHub
Speech in Flax/JAX
☆14Jul 11, 2022Updated 4 years ago
kehanlu / DeSTA2
View on GitHub
Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"
☆127Jul 15, 2025Updated last year
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
Crystalsound / FRN
View on GitHub
☆28Apr 17, 2023Updated 3 years ago
dihardchallenge / dihard3_baseline
View on GitHub
☆30Jul 21, 2022Updated 4 years ago
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆31Mar 24, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
alvations / usaarhat-repo
View on GitHub
Hack and Tell @ Saarland University
☆19Dec 11, 2017Updated 8 years ago
kehanlu / Mandarin-Wav2Vec2
View on GitHub
Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
chitralekha18 / PESnQ_APSIPA2017
View on GitHub
☆25Oct 10, 2019Updated 6 years ago
nii-yamagishilab / SpeechSPC-mini
View on GitHub
Speech Security and Privacy Compendium - Mini
☆10Jun 18, 2024Updated 2 years ago
EmulationAI / awesome-large-audio-models
View on GitHub
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
☆734Jun 3, 2026Updated last month
AjeetGitHub2016 / deeplearning.ai
View on GitHub
deeplearning.ai is the complete course on Deep Learning on Coursera. The instructor of this course is Andrew Ng. Programming assignments…
☆12Jul 6, 2018Updated 8 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
juice500ml / dysarthria-gop
View on GitHub
Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…
☆28Mar 13, 2025Updated last year
archiki / Robust-E2E-ASR
View on GitHub
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆49Dec 25, 2024Updated last year
sc0ttms / SE-DCCRN
View on GitHub
☆22Mar 2, 2022Updated 4 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
usc-sail / child-adult-diarization
View on GitHub
public child-adult speaker diarization/classification model and codes
☆19Apr 24, 2025Updated last year
sanphiee / MOT-sGPLDA-SRE14
View on GitHub
Multiobjective Optimization Training of PLDA for Speaker Verification
☆10Jun 14, 2018Updated 8 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago