intflow / YOLOX_AUDIO
Audio event detection model based on YOLOX
☆85Updated last year
Related projects ⓘ
Alternatives and complementary repositories for YOLOX_AUDIO
- 2020 AI Grand Challenge (3rd track) - public sample☆17Updated 3 years ago
- ☆10Updated 3 years ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆14Updated 4 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 3 years ago
- perturbation_autovc☆18Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 3 years ago
- zero_shot_gradtts☆14Updated last year
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆26Updated 2 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆23Updated 2 years ago
- Problem Generator for Math Word Prediction☆17Updated 2 years ago
- ☆62Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆58Updated last month
- Audio Only Speech Enhancement using Unet☆9Updated 3 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆23Updated 4 months ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆65Updated 2 years ago
- Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)☆50Updated 2 years ago
- ☆23Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆89Updated 3 years ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆49Updated 3 years ago
- Recipe for LibriPhrase☆23Updated last year
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆64Updated 2 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆59Updated 2 years ago
- DCCRN with various loss functions☆91Updated 2 years ago
- ☆46Updated last year
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS …☆95Updated last month
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆54Updated 6 months ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆26Updated 2 years ago
- A training code template for DNN-based speech enhancement.☆53Updated 6 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆45Updated 2 months ago