jim-schwoebel / sound_event_detection
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆41Updated 2 years ago
Alternatives and similar repositories for sound_event_detection:
Users that are interested in sound_event_detection are comparing it to the libraries listed below
- Sound event detection with depthwise separable and dilated convolutions.☆54Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- ☆13Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆127Updated 2 years ago
- Python library for audio augmentation☆83Updated last year
- ☆29Updated 6 months ago
- Machine learning speaker characteristics☆33Updated this week
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated 3 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 8 months ago
- ☆63Updated 4 months ago
- Easy to use Audio Tagging in PyTorch☆20Updated 3 years ago
- ☆56Updated 3 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆101Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆111Updated 4 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆81Updated 5 months ago
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆71Updated 6 months ago
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆25Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 6 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆71Updated 9 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 4 months ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago