intflow / YOLOX_AUDIOLinks
Audio event detection model based on YOLOX
☆86Updated 2 years ago
Alternatives and similar repositories for YOLOX_AUDIO
Users that are interested in YOLOX_AUDIO are comparing it to the libraries listed below
Sorting:
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆31Updated 6 months ago
- ☆93Updated 2 years ago
- ☆66Updated last year
- Phase-aware speech enchancement with Deep Complex U-Net☆129Updated 2 years ago
- Official code for Metric learning for user-defined keyword spotting☆34Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Updated 4 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆36Updated 6 months ago
- Problem Generator for Math Word Prediction☆17Updated 3 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- Source code for Consistent ensemble distillation for audio tagging☆46Updated 3 months ago
- ☆71Updated 2 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆269Updated 2 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆91Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- Reading list for research topics in Sound AI☆189Updated last year
- ☆26Updated 2 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆157Updated 3 years ago
- DCCRN with various loss functions☆101Updated 2 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆94Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆141Updated last month
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Updated 6 years ago
- Domestic environment sound event detection task☆146Updated last year
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 4 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆115Updated 2 years ago
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆49Updated 5 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆55Updated 4 years ago