MTG/Podcastmix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MTG/Podcastmix)

MTG / Podcastmix

PodcastMix A dataset for separating music and speech in podcasts.

☆44

Alternatives and similar repositories for Podcastmix

Users that are interested in Podcastmix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
drscotthawley / fad_pytorch
View on GitHub
Frechet Audio Distance evaluation in PyTorch
☆36Jun 9, 2023Updated 3 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
merlresearch / hyper-unmix
View on GitHub
Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…
☆73Apr 27, 2023Updated 3 years ago
otnemrasordep / ismir2022-datasets
View on GitHub
list of MIR dataset papers presented at ISMIR 2022
☆61Dec 11, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
salu133445 / deepperformer
View on GitHub
Deep Performer: Score-to-audio music performance synthesis
☆47Jun 26, 2023Updated 3 years ago
audiolabs / PESQ
View on GitHub
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band) - including P.862 Corrigendum 2 (03/…
☆23May 27, 2025Updated last year
chrispla / music-rearranger
View on GitHub
Rearrange a music recording to match a new duration - Code for "Music Rearrangement Using Hierarchical Segmentation", ICASSP 2023
☆46Jun 28, 2026Updated 3 weeks ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
jeonchangbin49 / musdb-XL
View on GitHub
☆16Sep 7, 2022Updated 3 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
kunimi00 / ContrastiveSSLMusicAudio
View on GitHub
☆13Jun 2, 2022Updated 4 years ago
fcaspe / dx7pytorch
View on GitHub
A musical instrument audio dataset generated on-the-fly using FM synthesis.
☆41Jun 5, 2026Updated last month
EvelynZhou / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆12Nov 30, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
voidful / vall-e-encodec
View on GitHub
☆41May 15, 2023Updated 3 years ago
KinWaiCheuk / demucs_lightning
View on GitHub
Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features
☆85May 3, 2023Updated 3 years ago
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
amazon-science / contextual-attention-nlm
View on GitHub
Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.
☆14Jul 25, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
cyrusasfa / meso-dtfa
View on GitHub
Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)
☆21Jun 30, 2026Updated 3 weeks ago
lankaraniamir / music-library-db-web-interface
View on GitHub
PostgreSQL database and a corresponding Flask web interface to store music library information dynamically with multiple linked versions …
☆10Jun 2, 2023Updated 3 years ago
sevagh / OnAir-Music-Dataset
View on GitHub
a new stem dataset for Music Demixing research, from the OnAir royalty-free music project
☆37Mar 14, 2023Updated 3 years ago
scart97 / thunder-speech
View on GitHub
A Hackable speech recognition library.
☆25Oct 16, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
etzinis / unsup_speech_enh_adaptation
View on GitHub
Unsupervised domain adaptation for conversational speech enhancement using RemixIT
☆59Apr 25, 2023Updated 3 years ago
DDMAL / SALAMI
View on GitHub
SALAMI Project Code
☆22May 1, 2021Updated 5 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
biboamy / TVSM-dataset
View on GitHub
☆95May 2, 2026Updated 2 months ago
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
alexanderlerch / conference-deadlines
View on GitHub
MIR conference deadline countdowns
☆19Jun 24, 2022Updated 4 years ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆96Nov 24, 2025Updated 8 months ago