QingyuLiu0521 / ICSDLinks
ICSD Dataset
☆38Updated 6 months ago
Alternatives and similar repositories for ICSD
Users that are interested in ICSD are comparing it to the libraries listed below
Sorting:
- ☆31Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 4 years ago
- ☆60Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆41Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆77Updated 6 months ago
- TODO☆44Updated 2 years ago
- ☆18Updated 3 years ago
- ☆66Updated 2 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Updated 3 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Vox-Profile Benchmark☆58Updated 3 months ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 3 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Updated last year
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Updated 3 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆28Updated 2 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Updated last year
- A Pytorch version of LPCNet, including dump weight☆36Updated 3 years ago
- ☆19Updated 2 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆34Updated this week
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago