satvik-venkatesh / audio-seg-data-synthView external linksLinks
Artificially synthesising data for audio segmentation to improve music-speech detection
☆18Jul 7, 2021Updated 4 years ago
Alternatives and similar repositories for audio-seg-data-synth
Users that are interested in audio-seg-data-synth are comparing it to the libraries listed below
Sorting:
- Improving beat tracking algorithms with recurrent neural networks.☆11Jan 7, 2019Updated 7 years ago
- Python framework for Speech and Music Detection using Keras.☆109Mar 24, 2023Updated 2 years ago
- Hearing loss simulation VST plugin☆13Mar 14, 2025Updated 11 months ago
- ☆101Oct 13, 2022Updated 3 years ago
- A NEW VERSION OF MIXING SECRETS DATASET FOR MUSIC SOURCE SEPARATION☆21Mar 3, 2023Updated 2 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- ☆16Jul 20, 2021Updated 4 years ago
- Speech/Music discrimination using SampleCNN☆18May 30, 2025Updated 8 months ago
- music denoising network☆16Sep 24, 2024Updated last year
- Matlab implementation of the: J.R. Zapata, M. Davies and E. Gómez, "Multi-feature beat tracker," IEEE/ACM Transactions on Audio, Speech a…☆20Dec 2, 2024Updated last year
- Code of the paper "Music Boundary Detection using Convolutional Neural Networks: A comparative analysis of combined input features" in Py…☆40Apr 2, 2021Updated 4 years ago
- Real-time audio analysis with Keras for Speech and Music Detection.☆21Nov 15, 2018Updated 7 years ago
- Time-domain Audio Separation Network (IN PYTORCH)☆23Jan 28, 2019Updated 7 years ago
- Unofficial implementation of SpecTNT in pytorch☆50Oct 14, 2022Updated 3 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆22Dec 8, 2022Updated 3 years ago
- ☆17Dec 17, 2025Updated last month
- A python library for real-time audio time-scale modification procedures☆89Oct 7, 2017Updated 8 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 8 months ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- Bottom-up Broadcast Neural Network For Music Genre Classification☆22Feb 3, 2021Updated 5 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆31Jan 28, 2018Updated 8 years ago
- This repository implements the Wave-U-net architecture in TensorFlow 2☆26Mar 16, 2021Updated 4 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 3 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- This repository contains the implementation of an efficient joint beat, downbeat, tempo, and meter tracking system using a compact 1D pro…☆73Nov 28, 2023Updated 2 years ago
- Time-Scale Modification For MATLAB☆66Jul 24, 2025Updated 6 months ago
- ☆27Apr 12, 2018Updated 7 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- Repository for the workshop titled 'Modelling room acoustics for immersive audio applications'☆32Sep 11, 2024Updated last year
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆53Nov 3, 2025Updated 3 months ago
- ☆10Aug 22, 2017Updated 8 years ago
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- misc programming languages☆11Jan 10, 2023Updated 3 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 6 months ago
- ☆11Apr 20, 2020Updated 5 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- A Python example For Acoustic Howling Suppression☆84Jul 29, 2020Updated 5 years ago
- (Interspeech 2025, official code) Speech enhancement based on cascaded two flows☆16Sep 1, 2025Updated 5 months ago