fschmid56 / PretrainedSED
☆36Updated this week
Alternatives and similar repositories for PretrainedSED:
Users that are interested in PretrainedSED are comparing it to the libraries listed below
- ☆24Updated 5 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Sound Event Detection (SED) paper collection☆13Updated 9 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆27Updated 6 months ago
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 7 months ago
- Query-conditioned target sound extraction model☆20Updated 4 months ago
- Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…☆14Updated last week
- ☆30Updated 8 months ago
- ☆18Updated 3 weeks ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 2 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆30Updated 6 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 5 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆37Updated 9 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆46Updated 2 months ago
- ☆18Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆15Updated 3 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated this week
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆36Updated 5 months ago
- Baseline code for DCASE 2023 task 4 B☆13Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆41Updated 2 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆24Updated 3 weeks ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 6 months ago
- ☆15Updated 2 years ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆43Updated 2 weeks ago
- ☆62Updated 6 months ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated 2 years ago
- ☆34Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆25Updated 11 months ago