satvik-venkatesh / audio-seg-data-synth
Artificially synthesising data for audio segmentation to improve music-speech detection
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio-seg-data-synth
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆37Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆41Updated 3 years ago
- experiments about AudioSet☆43Updated last year
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 5 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆94Updated 2 years ago
- Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"☆15Updated 3 years ago
- Simple baseline model for the HEAR benchmark☆22Updated last week
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- ☆18Updated 3 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- ☆33Updated last year
- Da - ECHO - RetrievAl - daTasEt☆24Updated 4 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 2 months ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Pitch-shifting and time-stretching with TD-PSOLA☆76Updated last year
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆34Updated 8 months ago
- ☆17Updated 3 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated last year
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- Distributed semi-constrained microphone arrays☆29Updated 6 months ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Updated 4 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago
- CP-JKU submission to DCASE 20☆43Updated 3 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆54Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 3 months ago