intflow / YOLOX_AUDIO
Audio event detection model based on YOLOX
☆86Updated 2 years ago
Alternatives and similar repositories for YOLOX_AUDIO:
Users that are interested in YOLOX_AUDIO are comparing it to the libraries listed below
- ☆10Updated 4 years ago
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 4 years ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆14Updated 5 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 4 years ago
- perturbation_autovc☆18Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- Problem Generator for Math Word Prediction☆17Updated 3 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 4 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆90Updated 3 years ago
- zero_shot_gradtts☆14Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- ☆63Updated 7 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆26Updated last month
- ☆60Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 2 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆110Updated 2 years ago
- Recipe for LibriPhrase☆28Updated last year
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆27Updated last month
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆70Updated 6 months ago
- Audio Only Speech Enhancement using Unet☆9Updated 4 years ago
- ☆110Updated 4 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆28Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- Framework for training and evaluating self-supervised learning methods for speaker verification.☆22Updated 2 months ago
- ☆69Updated 2 years ago