intflow / YOLOX_AUDIO
Audio event detection model based on YOLOX
☆86Updated 2 years ago
Alternatives and similar repositories for YOLOX_AUDIO
Users that are interested in YOLOX_AUDIO are comparing it to the libraries listed below
Sorting:
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 4 years ago
- ☆10Updated 4 years ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆14Updated 5 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 4 years ago
- perturbation_autovc☆18Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 2 years ago
- ☆63Updated 8 months ago
- Problem Generator for Math Word Prediction☆17Updated 3 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 4 years ago
- zero_shot_gradtts☆14Updated last year
- Official code for Metric learning for user-defined keyword spotting☆31Updated last year
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆26Updated 2 months ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆29Updated last month
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- ☆19Updated 2 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago
- Recipe for LibriPhrase☆28Updated last year
- Official repository of NeXt-TDNN for speaker verification☆71Updated 7 months ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆67Updated 3 years ago
- ☆88Updated last year
- Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)☆50Updated 3 years ago
- ☆30Updated last year
- DCCRN with various loss functions☆95Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆27Updated 2 years ago
- ☆83Updated 3 weeks ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆61Updated 2 years ago