jim-schwoebel / sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆41Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sound_event_detection
- Easy to use Audio Tagging in PyTorch☆20Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- ☆13Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- ☆55Updated 3 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 3 years ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆61Updated 11 months ago
- This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"☆10Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Conformer-based Metric GAN for speech enhancement☆26Updated 6 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 months ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆38Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆12Updated 4 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆35Updated last year
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆111Updated 5 months ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆52Updated 3 years ago
- Paderborn Sound Event Detection☆70Updated last year
- Language modelling for sound event detection☆21Updated 4 years ago
- A fast implementation of bss_eval metrics for blind source separation☆131Updated 2 years ago
- Implementation of Phase-aware speech enhancement with deep complex U-Net☆38Updated last year
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated last year
- This repository contains the audio samples and the source code that accompany the paper: "MixCycle: Unsupervised Speech Separation via Cy…☆23Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 3 months ago