QingyuLiu0521 / ICSD
ICSD Dataset
☆21Updated 7 months ago
Alternatives and similar repositories for ICSD:
Users that are interested in ICSD are comparing it to the libraries listed below
- ☆30Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- ☆20Updated 5 months ago
- ☆26Updated last year
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆67Updated 3 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 8 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆35Updated last week
- ☆25Updated 2 years ago
- ☆69Updated 2 years ago
- ☆14Updated 2 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆67Updated 2 years ago
- ☆45Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆27Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 2 months ago
- ☆65Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆46Updated 10 months ago
- ☆13Updated 5 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆45Updated 2 weeks ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆37Updated 6 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆26Updated 3 months ago
- TODO☆37Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆38Updated last month
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 2 months ago