yangdongchao / Target-sound-event-detectionView external linksLinks
The source code for target sound detection
☆15Feb 26, 2022Updated 3 years ago
Alternatives and similar repositories for Target-sound-event-detection
Users that are interested in Target-sound-event-detection are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- ☆13Apr 18, 2019Updated 6 years ago
- ☆14May 9, 2022Updated 3 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆113Jun 4, 2025Updated 8 months ago
- ☆17Feb 14, 2020Updated 6 years ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 6 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 3 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- ☆20May 13, 2019Updated 6 years ago
- ☆131Jul 21, 2021Updated 4 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆53Feb 16, 2023Updated 3 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆59Sep 28, 2024Updated last year
- context-aware Unet based on transformer for speech denoising☆24Feb 6, 2021Updated 5 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆52Mar 30, 2020Updated 5 years ago
- Evaluation toolbox for Sound Event Detection☆157Jun 12, 2024Updated last year
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Jul 3, 2024Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Jan 31, 2022Updated 4 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 2 years ago
- ☆24Sep 10, 2025Updated 5 months ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169May 14, 2022Updated 3 years ago
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆250Dec 20, 2022Updated 3 years ago
- ☆60Jul 2, 2024Updated last year