Full models and training code for PESTO
☆76Jun 12, 2024Updated last year
Alternatives and similar repositories for pesto-full
Users that are interested in pesto-full are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-supervised learning for real-time pitch estimation☆282Oct 15, 2025Updated 5 months ago
- Viterbi decoding in PyTorch☆42Sep 10, 2025Updated 6 months ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated last year
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 5 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Dec 16, 2022Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆34Apr 22, 2024Updated last year
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 11 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated last month
- PyTorch version of Spotify's Basic Pitch☆49Apr 19, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆137Feb 3, 2025Updated last year
- A DDSP-based neural voice synthesiser.☆132Nov 14, 2024Updated last year
- ☆55Nov 5, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Differentiable audio signal processors in PyTorch☆287Dec 4, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"☆61Mar 8, 2023Updated 3 years ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆106Nov 25, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆93Nov 24, 2025Updated 4 months ago
- ☆31Apr 22, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆64Nov 6, 2023Updated 2 years ago
- ☆67Aug 16, 2023Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19May 9, 2019Updated 6 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆47May 16, 2025Updated 10 months ago
- ☆23Aug 4, 2025Updated 7 months ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated last month
- applying audio FX with text descriptors☆33Nov 12, 2025Updated 4 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆109Jan 17, 2025Updated last year