NINAnor / rare_species_detections
Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complete the DCASE2023 challenge on few-shot bioacoustic events.
☆33Updated this week
Alternatives and similar repositories for rare_species_detections:
Users that are interested in rare_species_detections are comparing it to the libraries listed below
- Source code for Consistent ensemble distillation for audio tagging☆21Updated 6 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆75Updated last week
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆159Updated last month
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆79Updated 4 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆37Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆46Updated 3 months ago
- ☆30Updated last year
- ☆64Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆111Updated 4 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated 2 months ago
- A simple package for Guided source separation (GSS)☆112Updated 7 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆65Updated 4 months ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆110Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆31Updated 5 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆58Updated last month
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆23Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆111Updated last month
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 9 months ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆57Updated 8 months ago
- ☆44Updated last year
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆33Updated 7 months ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆54Updated 3 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆44Updated 4 years ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 5 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆52Updated 4 months ago
- ConMamba for Automatic Speech Recognition☆53Updated 5 months ago