intflow / YOLOX_AUDIOLinks
Audio event detection model based on YOLOX
☆86Updated 2 years ago
Alternatives and similar repositories for YOLOX_AUDIO
Users that are interested in YOLOX_AUDIO are comparing it to the libraries listed below
Sorting:
- Phase-aware speech enchancement with Deep Complex U-Net☆118Updated 2 years ago
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023☆30Updated 4 months ago
- DCCRN with various loss functions☆96Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- ☆65Updated 10 months ago
- ☆66Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆92Updated 3 years ago
- Official code for Metric learning for user-defined keyword spotting☆33Updated last year
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆265Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆28Updated 2 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆102Updated 3 years ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆32Updated 3 months ago
- A training code template for DNN-based speech enhancement.☆108Updated 3 months ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆67Updated last month
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆68Updated 3 years ago
- Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)☆43Updated 4 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆62Updated 2 years ago
- STOI loss function in PyTorch☆92Updated 9 months ago
- This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhanc…☆39Updated 4 years ago
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆213Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- ☆109Updated 4 years ago
- ☆69Updated 2 years ago
- An example of a speech enhancement model deployed with TensorRT.☆63Updated 3 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- ☆91Updated 2 years ago
- ☆30Updated 2 years ago
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆29Updated 9 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year