The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
Alternatives and similar repositories for Target-sound-event-detection
Users that are interested in Target-sound-event-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- ☆14Apr 18, 2019Updated 7 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Sep 18, 2020Updated 5 years ago
- ☆15May 9, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 4 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆114Jun 4, 2025Updated 11 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- Evaluation toolbox for Sound Event Detection☆159Jun 12, 2024Updated last year
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- ☆134Jul 21, 2021Updated 4 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆27Jan 11, 2022Updated 4 years ago
- Cochlear Modelling Jupyter Notebooks☆36Feb 2, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 4 months ago
- ☆20May 13, 2019Updated 6 years ago
- Reimplementation of Miipher☆30Aug 16, 2023Updated 2 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…☆14Feb 2, 2020Updated 6 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- ☆55Jun 3, 2020Updated 5 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆53Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆27Sep 10, 2025Updated 7 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Jan 31, 2022Updated 4 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆168May 14, 2022Updated 3 years ago
- Autonomous Driving W/ Deep Reinforcement Learning in Lane Keeping - DDQN and SAC with kinematics/birdview-images☆13Mar 24, 2026Updated last month
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- ☆60Jul 2, 2024Updated last year