intflow / YOLOX_AUDIO
Audio event detection model based on YOLOX
☆84Updated last year
Related projects: ⓘ
- Problem Generator for Math Word Prediction☆17Updated 2 years ago
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 3 years ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆14Updated 4 years ago
- ☆10Updated 3 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 3 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 3 years ago
- perturbation_autovc☆18Updated 10 months ago
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆23Updated 3 weeks ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- zero_shot_gradtts☆14Updated 10 months ago
- ☆61Updated last week
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆23Updated 2 years ago
- Deep learning based autism spectral disorder detection from children voice☆30Updated last year
- Recipe for LibriPhrase☆23Updated last year
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆20Updated 2 months ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 2 years ago
- Audio Only Speech Enhancement using Unet☆8Updated 3 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)☆41Updated 3 years ago
- Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)☆51Updated 2 years ago
- ☆44Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆35Updated 3 months ago
- ☆76Updated last year
- MetricGAN+ PyTorch Implementation☆18Updated 8 months ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆62Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 3 years ago
- ☆29Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆86Updated 3 years ago
- ☆70Updated 3 weeks ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆32Updated 2 years ago