SonyCSLParis/pesto-full

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SonyCSLParis/pesto-full)

SonyCSLParis / pesto-full

Full models and training code for PESTO

☆83

Alternatives and similar repositories for pesto-full

Users that are interested in pesto-full are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyCSLParis / pesto
View on GitHub
Self-supervised learning for real-time pitch estimation
☆297Oct 15, 2025Updated 9 months ago
maxrmorrison / torbi
View on GitHub
Viterbi decoding in PyTorch
☆42May 5, 2026Updated 2 months ago
PierreChouteau / umss_icassp
View on GitHub
ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation
☆14Mar 7, 2025Updated last year
cwitkowitz / ss-mpe
View on GitHub
Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".
☆25Sep 27, 2025Updated 10 months ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
muthissar / diffstm
View on GitHub
☆10Dec 16, 2022Updated 3 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
interactiveaudiolab / penn
View on GitHub
Pitch Estimating Neural Networks (PENN)
☆278Apr 2, 2025Updated last year
gudgud96 / basic-pitch-torch
View on GitHub
PyTorch version of Spotify's Basic Pitch
☆54Apr 19, 2024Updated 2 years ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
archinetai / cqt-pytorch
View on GitHub
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆73Dec 9, 2022Updated 3 years ago
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 8 months ago
DiffAPF / torchlpc
View on GitHub
Fast and differentiable time domain all-pole filter in PyTorch.
☆72Feb 5, 2026Updated 5 months ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
christhetree / mod_discovery
View on GitHub
Source code for "Modulation Discovery with Differentiable Digital Signal Processing".
☆15Mar 25, 2026Updated 4 months ago
sh-lee97 / grafx
View on GitHub
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
☆139Jun 29, 2026Updated 3 weeks ago
yoyolicoris / music-spectrogram-diffusion-pytorch
View on GitHub
☆88Jan 29, 2023Updated 3 years ago
iamycy / golf
View on GitHub
A DDSP-based neural voice synthesiser.
☆135Nov 14, 2024Updated last year
sony / diffusion-timbre-transfer
View on GitHub
☆56Nov 5, 2024Updated last year
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
CameronChurchwell / combnet
View on GitHub
☆23Aug 4, 2025Updated 11 months ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Dream-High / DJCM
View on GitHub
☆30Apr 22, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Victorletzelter / LoRA-MCL
View on GitHub
Multiple Choice Learning of Low Rank Adapters for Language Modeling
☆16Feb 26, 2026Updated 5 months ago
ben-hayes / sinusoidal-gradient-descent
View on GitHub
Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"
☆61Mar 8, 2023Updated 3 years ago
csteinmetz1 / dasp-pytorch
View on GitHub
Differentiable audio signal processors in PyTorch
☆298Dec 4, 2023Updated 2 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated 2 years ago
SonyResearch / diffvox
View on GitHub
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆40Oct 28, 2025Updated 9 months ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SonyCSLParis / ssl-singer-identity
View on GitHub
☆69Nov 6, 2023Updated 2 years ago
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
SonyResearch / ITO-Master
View on GitHub
Implementation of the paper "ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors"
☆27Jul 3, 2025Updated last year
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated 2 weeks ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
SonyCSLParis / audio-metrics
View on GitHub
Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.
☆47Jan 15, 2026Updated 6 months ago