QingyuLiu0521 / ICSDLinks
ICSD Dataset
☆33Updated 3 months ago
Alternatives and similar repositories for ICSD
Users that are interested in ICSD are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 4 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆13Updated 3 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 3 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- ☆66Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆57Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆69Updated 3 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 11 months ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago
- ☆18Updated 3 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆42Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆152Updated 2 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆91Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆47Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Clustering-based methods for overlapping diarization☆80Updated last year
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- ☆92Updated 11 months ago
- ☆65Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Updated last year
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Updated 6 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆35Updated 2 months ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- Discriminative Condition-Aware PLDA☆44Updated last year
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago