QingyuLiu0521 / ICSDLinks
ICSD Dataset
☆30Updated last month
Alternatives and similar repositories for ICSD
Users that are interested in ICSD are comparing it to the libraries listed below
Sorting:
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- ☆30Updated 2 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 3 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆13Updated 3 years ago
- ☆21Updated 9 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 9 months ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 3 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- ☆14Updated 3 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated 2 years ago
- TODO☆41Updated last year
- ☆26Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 11 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- ☆29Updated 3 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆46Updated last month
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆43Updated last year
- ☆53Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 10 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- ☆65Updated 2 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆35Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 9 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆22Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 7 months ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated 2 years ago