MTG / PodcastMix-inferenceLinks
☆32Updated 3 years ago
Alternatives and similar repositories for PodcastMix-inference
Users that are interested in PodcastMix-inference are comparing it to the libraries listed below
Sorting:
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 9 months ago
- ☆18Updated 3 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated last week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- A C++/Cython audio limiter for Python.☆25Updated 2 years ago
- ☆26Updated 4 years ago
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- ☆32Updated 4 years ago
- Frechet Audio Distance evaluation in PyTorch☆34Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆23Updated 3 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Reproducible Subjective Evaluation☆60Updated last year
- ☆15Updated 4 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆22Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 4 years ago
- Prosody and Pronunciation Modification Network☆54Updated 3 weeks ago
- PyTorch Dataset for Speech and Music audio☆76Updated 10 months ago
- ☆43Updated 11 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 10 months ago
- ☆83Updated 2 years ago
- Alignment examples for Interspeech 2024☆21Updated 10 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆35Updated 8 months ago
- A repo that builds text to music datasets from scratch☆21Updated 2 weeks ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 3 weeks ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year