MrSupW/ICMC-ASR_Baseline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MrSupW/ICMC-ASR_Baseline)

MrSupW / ICMC-ASR_Baseline

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

☆57

Alternatives and similar repositories for ICMC-ASR_Baseline

Users that are interested in ICMC-ASR_Baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
yufan-aslp / AliMeeting
View on GitHub
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆142Jun 10, 2022Updated 4 years ago
yluo42 / TAC
View on GitHub
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆310Jun 15, 2021Updated 5 years ago
desh2608 / gss
View on GitHub
A simple package for Guided source separation (GSS)
☆134May 20, 2024Updated 2 years ago
zqlsnr / DPCRN
View on GitHub
real-time speech enhance
☆18Jan 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mispchallenge / MISP-2023-Challenge-Baseline
View on GitHub
☆25Jan 2, 2024Updated 2 years ago
REAL-TSE / REAL-TSE-Challenge
View on GitHub
☆33Jun 1, 2026Updated last month
lixilinx / IVA4Cocktail
View on GitHub
Neural network density models for speech separation.
☆20Nov 26, 2020Updated 5 years ago
Diamondfan / cassnat_asr
View on GitHub
Implementation of CTC alignment-based single step non-autoregressive transformer
☆13Jun 2, 2023Updated 3 years ago
wenet-e2e / wesep
View on GitHub
Target Speaker Extraction Toolkit
☆299Oct 4, 2025Updated 9 months ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
jsalt2020-asrdiar / jsalt2020_simulate
View on GitHub
Training data simulation
☆60May 6, 2024Updated 2 years ago
DaiYvhang / AISHELL-5
View on GitHub
In-car multi-channel speech transcription system of AISHELL-5.
☆48Jun 9, 2025Updated last year
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
NiniAndy / Paraformer-V2
View on GitHub
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
felixfuyihui / AISHELL-4
View on GitHub
☆140Jul 21, 2021Updated 5 years ago
kooBH / DSS
View on GitHub
[WIP]Direction based Multi-Channel Speech Separation
☆14Jan 25, 2024Updated 2 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
TomJwYu / WenetSpeechSpeakerCluster
View on GitHub
☆55Jul 17, 2023Updated 3 years ago
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yoonsanghyu / FaSNet-TAC-PyTorch
View on GitHub
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
☆76Sep 14, 2021Updated 4 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
Enny1991 / beamformers
View on GitHub
Easy to use Beamformers for multi-channel speech separation/enhancement
☆216Jan 26, 2021Updated 5 years ago
liyunlongaaa / NSD-MS2S
View on GitHub
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…
☆88Jun 17, 2025Updated last year
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
ASLP-lab / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆47Mar 10, 2025Updated last year
vkothapally / JAECBF
View on GitHub
☆62Apr 11, 2022Updated 4 years ago
sh01k / imp_tsp
View on GitHub
Measuring impulse response with time-stretched pulse (TSP) signal
☆14Jul 3, 2019Updated 7 years ago
ZhongshuHou / MHA-DPCRN
View on GitHub
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
☆24Jul 4, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DavidDiazGuerra / gpuRIR
View on GitHub
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
☆607Jul 18, 2025Updated last year
kuan2jiu99 / audio-hallucination
View on GitHub
Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024
☆34Mar 14, 2025Updated last year
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
funcwj / aps
View on GitHub
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆146Jul 6, 2023Updated 3 years ago
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
jymh / SAP2-ASR
View on GitHub
☆26Jan 23, 2026Updated 6 months ago
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago