haoheliu / diffres-python
Learning differentiable temporal resolution on time-series data.
☆36Updated 2 years ago
Alternatives and similar repositories for diffres-python:
Users that are interested in diffres-python are comparing it to the libraries listed below
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 5 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆33Updated 3 weeks ago
- ☆30Updated last year
- ☆18Updated 2 years ago
- ☆48Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 11 months ago
- ☆62Updated 6 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆15Updated 3 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆35Updated 9 months ago
- ☆43Updated 2 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆26Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 4 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- Inference code for PaSST, using the HEAR API.☆31Updated last year
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆15Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆38Updated 2 years ago
- ☆57Updated 10 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆34Updated 5 months ago
- ☆52Updated 9 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- ☆23Updated 4 months ago
- Official repository of NeXt-TDNN for speaker verification☆67Updated 5 months ago
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆47Updated 4 months ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆55Updated 3 years ago