DmitryRyumin/ICASSP-2023-24-Papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DmitryRyumin/ICASSP-2023-24-Papers)

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

☆525

Alternatives and similar repositories for ICASSP-2023-24-Papers

Users that are interested in ICASSP-2023-24-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DmitryRyumin / INTERSPEECH-2023-24-Papers
View on GitHub
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …
☆685Dec 25, 2024Updated last year
DmitryRyumin / NewEraAI-Papers
View on GitHub
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source cod…
☆118Oct 24, 2025Updated 9 months ago
DmitryRyumin / CVPR-2023-24-Papers
View on GitHub
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest d…
☆452Jul 15, 2024Updated 2 years ago
ddlBoJack / Speech-Resources
View on GitHub
语音方向实验室/公司/资源/实习等，欢迎推荐或自荐
☆609Nov 13, 2024Updated last year
urgent-challenge / urgent2025_challenge
View on GitHub
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
☆85May 21, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
EmulationAI / awesome-large-audio-models
View on GitHub
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
☆734Jun 3, 2026Updated last month
liusongxiang / Large-Audio-Models
View on GitHub
Keep track of big models in audio domain, including speech, singing, music etc.
☆515Jul 3, 2026Updated 3 weeks ago
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆273Dec 12, 2025Updated 7 months ago
DmitryRyumin / Awesome-Speech-Enhancement
View on GitHub
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…
☆29Apr 19, 2024Updated 2 years ago
ga642381 / speech-trident
View on GitHub
Awesome speech/audio LLMs, representation learning, and codec models
☆1,240Jul 10, 2026Updated 2 weeks ago
WangHelin1997 / SpeechTasks
View on GitHub
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆83Jun 7, 2024Updated 2 years ago
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆308Jul 4, 2026Updated 3 weeks ago
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆142Jan 28, 2026Updated 6 months ago
ZhangXInFD / SpeechTokenizer
View on GitHub
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…
☆657Jun 9, 2024Updated 2 years ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago
microsoft / SIG-Challenge
View on GitHub
☆93Jun 9, 2024Updated 2 years ago
sp-uhh / sgmse-bbed
View on GitHub
Brownian Bridge with Exponential Diffusion Coefficient
☆43Nov 1, 2023Updated 2 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,556Mar 12, 2026Updated 4 months ago
kuan2jiu99 / audio-hallucination
View on GitHub
Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024
☆34Mar 14, 2025Updated last year
pyf98 / DPHuBERT
View on GitHub
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
☆118Jan 26, 2024Updated 2 years ago
Srijith-rkr / Whispering-LLaMA
View on GitHub
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
☆271May 19, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sarulab-speech / UTMOS22
View on GitHub
UT-Sarulab MOS prediction system using SSL models
☆309Apr 11, 2024Updated 2 years ago
sp-uhh / storm
View on GitHub
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆256Sep 13, 2024Updated last year
mct10 / RepCodec
View on GitHub
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
☆196Jul 12, 2024Updated 2 years ago
kuan2jiu99 / Awesome-Speech-Generation
View on GitHub
Survey on speech generation work.
☆21Nov 26, 2023Updated 2 years ago
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆952Aug 11, 2025Updated 11 months ago
modelscope / FunCodec
View on GitHub
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…
☆445Jan 25, 2024Updated 2 years ago
yamathcy / ISMIR-2023-Papers
View on GitHub
ISMIR 2023 Papers: A complete collection of influential and exciting research papers from the ISMIR 2023 conference.
☆105Dec 2, 2023Updated 2 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sp-uhh / sgmse
View on GitHub
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆764May 12, 2026Updated 2 months ago
microsoft / UniSpeech
View on GitHub
UniSpeech - Large Scale Self-Supervised Learning for Speech
☆486Apr 5, 2024Updated 2 years ago
jishengpeng / WavChat
View on GitHub
A Survey of Spoken Dialogue Models (60 pages)
☆316Nov 28, 2024Updated last year
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,337Jul 13, 2026Updated 2 weeks ago
WenzheLiu-Speech / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆1,244Nov 14, 2023Updated 2 years ago
facebookresearch / AudioDec
View on GitHub
An Open-source Streaming High-fidelity Neural Audio Codec
☆512Mar 4, 2025Updated last year
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago