intflow / YOLOX_AUDIOLinks
Audio event detection model based on YOLOX
☆86Updated 2 years ago
Alternatives and similar repositories for YOLOX_AUDIO
Users that are interested in YOLOX_AUDIO are comparing it to the libraries listed below
Sorting:
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆31Updated 7 months ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Updated 4 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆129Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- Problem Generator for Math Word Prediction☆17Updated 3 years ago
- ☆41Updated 8 months ago
- ☆66Updated last year
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆69Updated 3 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago
- ☆73Updated 2 years ago
- ☆30Updated 2 years ago
- This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhanc…☆40Updated 5 years ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆14Updated 5 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆63Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago
- ☆46Updated 8 months ago
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 4 years ago
- Official code for Metric learning for user-defined keyword spotting☆35Updated last year
- Official repository of NeXt-TDNN for speaker verification☆78Updated last year
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆18Updated 3 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆28Updated 3 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆36Updated 6 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆56Updated last year
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆79Updated 3 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- DCCRN with various loss functions☆101Updated 3 years ago
- ☆25Updated last year