SonyCSLParis / pesto-fullView external linksLinks
Full models and training code for PESTO
☆74Jun 12, 2024Updated last year
Alternatives and similar repositories for pesto-full
Users that are interested in pesto-full are comparing it to the libraries listed below
Sorting:
- Self-supervised learning for real-time pitch estimation☆275Oct 15, 2025Updated 3 months ago
- Viterbi decoding in PyTorch☆40Sep 10, 2025Updated 5 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated 11 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- PyTorch version of Spotify's Basic Pitch☆44Apr 19, 2024Updated last year
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated last week
- ☆55Nov 5, 2024Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 2 months ago
- Pitch Estimating Neural Networks (PENN)☆269Apr 2, 2025Updated 10 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆135Feb 3, 2025Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆32Apr 22, 2024Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 3 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆107Jan 17, 2025Updated last year
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- A DDSP-based neural voice synthesiser.☆126Nov 14, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- ☆66Aug 16, 2023Updated 2 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- ☆15Nov 11, 2024Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated 11 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- ☆86May 21, 2023Updated 2 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- applying audio FX with text descriptors☆32Nov 12, 2025Updated 3 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Differentiable audio signal processors in PyTorch☆283Dec 4, 2023Updated 2 years ago
- ☆74Apr 4, 2024Updated last year