JishengBai / ICME2024ASC
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆14Updated 6 months ago
Related projects: ⓘ
- ☆15Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆32Updated 2 years ago
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆13Updated 3 weeks ago
- ☆29Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆30Updated last year
- ☆61Updated last week
- AudioLDM training, finetuning, evaluation and inference.☆11Updated 5 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆32Updated 5 months ago
- Query-conditioned target sound extraction model☆14Updated 3 months ago
- ☆41Updated last year
- The source code of Tim-TSENet☆11Updated 2 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆13Updated last year
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆42Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆16Updated 2 weeks ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆36Updated 2 months ago
- ☆19Updated last year
- ☆18Updated 2 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆21Updated last year
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆61Updated 2 years ago
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆29Updated 4 months ago
- ☆26Updated last year
- This is official repository of new SOTA diffusion models based method for speech enhancement☆28Updated last month
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆17Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆23Updated last month
- ☆24Updated last week
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated last year
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆64Updated 2 years ago