biboamy/AVASpeech_Music_Labels

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/biboamy/AVASpeech_Music_Labels)

biboamy / AVASpeech_Music_Labels

☆20

Alternatives and similar repositories for AVASpeech_Music_Labels

Users that are interested in AVASpeech_Music_Labels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JozefColdenhoff / OpenACE
View on GitHub
☆11Aug 1, 2025Updated 11 months ago
satvik-venkatesh / audio-seg-data-synth
View on GitHub
Artificially synthesising data for audio segmentation to improve music-speech detection
☆17Jul 7, 2021Updated 5 years ago
felixCheungcheung / mixing_secrets_v2
View on GitHub
A NEW VERSION OF MIXING SECRETS DATASET FOR MUSIC SOURCE SEPARATION
☆22Mar 3, 2023Updated 3 years ago
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
Sma1033 / drum_generation_with_ssm
View on GitHub
This is the supplemental repository for ISMIR 2019 paper GENERATING STRUCTURED DRUM PATTERN USING VARIATIONAL AUTOENCODER AND SELF-SIMILA…
☆23Oct 28, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OmarMedhat22 / Sound-Classification-Short-Time-Fourier-Transform-STFT
View on GitHub
☆15May 28, 2020Updated 6 years ago
kaistmm / TalkNCE
View on GitHub
Official implementation of TalkNCE (ICASSP 2024).
☆18Apr 30, 2025Updated last year
InsightSoftwareConsortium / ITKIOOMEZarrNGFF
View on GitHub
ITK IO for images stored in OME-Zarr format.
☆11Sep 4, 2025Updated 10 months ago
morgan76 / HE
View on GitHub
PyTorch implementation of the paper Learning Multi-Level Representations for Hierarchical Music Structure Analysis presented at ISMIR 202…
☆16Jan 2, 2023Updated 3 years ago
keunwoochoi / music4all_contrib
View on GitHub
☆32Dec 29, 2020Updated 5 years ago
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆21Jun 25, 2026Updated 3 weeks ago
sunyilong0 / Electric-Bicycle-MIS
View on GitHub
本文根据需求分析，基于Smobiler平台与C#语言开发出了一款“校园电动车管理信息系统”的手机APP。系统使用Visual
☆10Mar 30, 2022Updated 4 years ago
MTG / da-tacos
View on GitHub
A Dataset for Cover Song Identification and Understanding
☆66Feb 23, 2023Updated 3 years ago
barisbozkurt / MASTmelody_dataset
View on GitHub
A dataset of pitch curves for music performance assessment
☆11Jun 5, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JorenSix / JGaborator
View on GitHub
Fast Gabor spectral transforms in Java. Using a JNI bridge with the gaborator C++ library.
☆14Jan 20, 2023Updated 3 years ago
markostam / coversongs-dual-convnet
View on GitHub
Deep learning model trained to automatically identify cover songs using siamese convnets and tied together with a fully-connected sofmax.
☆19Jul 9, 2018Updated 8 years ago
jerryuhoo / VISinger
View on GitHub
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
☆39Feb 24, 2023Updated 3 years ago
exeex / maps-dataset
View on GitHub
MAPS ( MIDI Aligned Piano Sounds ) dataset python api for machine learning
☆11Jun 26, 2018Updated 8 years ago
carlthome / pmqd
View on GitHub
Perceived Music Quality Dataset
☆12Jul 1, 2024Updated 2 years ago
soham97 / PAM
View on GitHub
PAM is a no-reference audio quality metric for audio generation tasks
☆77Jul 19, 2024Updated 2 years ago
yuguochencuc / DeepFilterNet2
View on GitHub
Noise supression using deep filtering
☆20May 31, 2022Updated 4 years ago
yoyolicoris / music-demixing-challenge-ismir-2021-entry
View on GitHub
The training code for the 4th place model at MDX 2021 leaderboard A.
☆36Sep 1, 2021Updated 4 years ago
moises-ai / moises-db
View on GitHub
Moises Source Separation Public Dataset
☆191Feb 5, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bytedance / midi_melody_extraction
View on GitHub
☆23Sep 27, 2023Updated 2 years ago
breizhn / DNS-Challenge
View on GitHub
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open so…
☆15May 15, 2020Updated 6 years ago
merlresearch / cocktail-fork-separation
View on GitHub
Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset
☆89Jan 25, 2024Updated 2 years ago
jvbalen / sample_100
View on GitHub
A dataset of Hip Hop samples for Music Information Retrieval research
☆11Jun 1, 2016Updated 10 years ago
deezer / cover_song_detection
View on GitHub
Tools to run experiments around large scale cover detection.
☆28Sep 30, 2022Updated 3 years ago
furkanyesiler / move
View on GitHub
PyTorch code for training and evaluating MOVE, musically-motivated version embeddings
☆50Jul 6, 2023Updated 3 years ago
interactiveaudiolab / MSG
View on GitHub
☆53Jun 27, 2023Updated 3 years ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
jakeoneijk / FlashSR_Inference
View on GitHub
☆78Jan 25, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
alexw16 / gridnet
View on GitHub
☆15Jul 9, 2025Updated last year
diegotg2000 / PitchFlower
View on GitHub
Official implementation of the paper PitchFlower: A flow-based neural audio codec with pitch controllability
☆36Nov 3, 2025Updated 8 months ago
qlemaire22 / speech-music-detection
View on GitHub
Python framework for Speech and Music Detection using Keras.
☆113Mar 24, 2023Updated 3 years ago
csteinmetz1 / bela-zlc
View on GitHub
Zero-latency convolution on Bela platform
☆28Aug 4, 2021Updated 4 years ago
SonyCSLParis / cae-invar
View on GitHub
Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder
☆38Dec 16, 2024Updated last year
Rt1z / Light-Musician
View on GitHub
Light musician is a tool to convert song to its light version. With Light Player, vocals in a song can be convert to other instruments us…
☆11Aug 29, 2022Updated 3 years ago
darius522 / dnr-utils
View on GitHub
Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…
☆74Feb 13, 2023Updated 3 years ago