RicherMans / CED
Source code for Consistent ensemble distillation for audio tagging
☆26Updated 8 months ago
Alternatives and similar repositories for CED:
Users that are interested in CED are comparing it to the libraries listed below
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 6 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆75Updated 2 months ago
- ☆33Updated last month
- ☆49Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆56Updated last month
- ☆25Updated last year
- ☆30Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆121Updated 5 months ago
- A simple package for Guided source separation (GSS)☆117Updated 10 months ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆114Updated 3 months ago
- multi-scale time domain speaker extraction☆61Updated 3 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆56Updated 4 years ago
- ☆27Updated 2 years ago
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆94Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- ☆57Updated 10 months ago
- ☆43Updated 2 years ago
- ☆31Updated 2 years ago
- ☆98Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆47Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆26Updated 3 months ago
- ☆33Updated 3 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Training data simulation☆47Updated 10 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆65Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 11 months ago
- ☆32Updated 2 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆119Updated 3 months ago