fschmid56 / PretrainedSEDLinks
☆96Updated 4 months ago
Alternatives and similar repositories for PretrainedSED
Users that are interested in PretrainedSED are comparing it to the libraries listed below
Sorting:
- ☆23Updated 11 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆130Updated last week
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆83Updated 4 months ago
- EVAR ~ Evaluation package for Audio Representations☆64Updated last week
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆70Updated 4 months ago
- ☆55Updated 10 months ago
- ☆38Updated 4 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆29Updated 3 months ago
- A benchmark for evaluating audio encoders on various audio tasks.☆27Updated last month
- Exploring Binary Classification Loss for Speaker Verification☆17Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆37Updated 2 months ago
- ☆88Updated last year
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆48Updated 4 months ago
- Sound Event Detection (SED) paper collection☆15Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆20Updated 3 weeks ago
- Prediction of sound event bounding boxes (SEBBs)☆30Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆43Updated 4 months ago
- ARCH: Audio Representations benCHmark☆50Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆69Updated last year
- This package aims at simplifying the download of the AudioSet dataset.☆54Updated 2 months ago
- Query-conditioned target sound extraction model☆25Updated 6 months ago
- ☆58Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆74Updated last year
- ☆17Updated last month
- Code for CVSSP submission to DCASE 2021 Task 6☆36Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆60Updated 3 months ago