SonyCSLParis / pesto
Self-supervised learning for fast pitch estimation
☆171Updated last month
Related projects: ⓘ
- Pitch Estimating Neural Networks (PENN)☆227Updated last month
- A DDSP-based neural voice synthesiser.☆95Updated last week
- A simple library for Fréchet Audio Distance (FAD) calculation☆137Updated last week
- Encode and decode audio samples to/from compressed latent representations!☆119Updated last month
- Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention☆88Updated 5 months ago
- AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)☆93Updated 2 weeks ago
- ☆71Updated last year
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆218Updated last month
- Unofficial download repository for MusicCaps☆41Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆109Updated 8 months ago
- Audiogen Codec☆116Updated 2 months ago
- ☆158Updated 7 months ago
- ☆154Updated 10 months ago
- Full models and training code for PESTO☆48Updated 3 months ago
- Official implementation of SawSing (ISMIR'22)☆250Updated 2 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆147Updated 2 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆130Updated last year
- Models and datasets for training deep learning automatic mixing models☆92Updated 3 weeks ago
- Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture…☆70Updated last year
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆81Updated this week
- Moises Source Separation Public Dataset☆106Updated 9 months ago
- Sync Toolbox - Python package with reference implementations for efficient, robust, and accurate music synchronization based on dynamic t…☆108Updated 9 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆123Updated 2 months ago
- ☆26Updated 10 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆58Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- Sound Demixing Challenge 2023☆70Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆111Updated last year
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆109Updated last year
- Headless multitrack mixing console in Python☆114Updated last year