This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training".
☆29Feb 8, 2026Updated last month
Alternatives and similar repositories for PianistTransformer
Users that are interested in PianistTransformer are comparing it to the libraries listed below
Sorting:
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 4 months ago
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆49Dec 4, 2025Updated 3 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆35Sep 11, 2025Updated 6 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆78Jun 19, 2025Updated 9 months ago
- ☆18May 14, 2025Updated 10 months ago
- Performance MIDI to Score (PM2S)☆75Oct 10, 2024Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 6 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- Training, validation, and inference code for various SSL approaches and architectures.☆80Oct 22, 2025Updated 4 months ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆91Jan 31, 2026Updated last month
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated 3 weeks ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- TheGlueNote is representation model for note-wise music alignment.☆12Jul 19, 2024Updated last year
- This program converts .fits file to .jpg. Fits to jpeg.☆13Jun 4, 2018Updated 7 years ago
- Papernotes about Music Information Retrieval☆32Nov 13, 2019Updated 6 years ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 8 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- ☆71Jun 12, 2025Updated 9 months ago
- Bassline generator - Max For Live Midi device☆14Oct 16, 2020Updated 5 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆11Nov 25, 2021Updated 4 years ago
- The Audio Score Alignment Test dataset for Ottoman-Turkish makam music☆11Apr 20, 2017Updated 8 years ago
- ☆10Mar 10, 2021Updated 5 years ago
- ☆17Jun 24, 2025Updated 8 months ago
- Run Whisper on audio items directly from REAPER and import the text as text items.☆13Jul 8, 2024Updated last year
- A JavaScript toolkit for remote net art performance.☆12May 1, 2016Updated 9 years ago
- A song aesthetic evaluation toolkit trained on SongEval.☆288Jun 15, 2025Updated 9 months ago
- A port of Runebender from Druid to Xilem☆46Updated this week
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆32Aug 30, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆34Oct 15, 2025Updated 5 months ago
- Mandarin Chinese audio datasets aligned with Montreal Forced Aligner☆17Aug 13, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆27Updated this week
- A dataset of pitch curves for music performance assessment☆10Jun 5, 2023Updated 2 years ago
- A set of corpus-based sampling & analysis M4L devices☆11Oct 29, 2025Updated 4 months ago
- MAX/MSP objects for audio and rhythmic synthesis using networks of coupled oscillators☆13May 5, 2023Updated 2 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 13, 2026Updated last week
- Utu is a command-line program that uses the Loris library to analyze sounds.☆16Oct 11, 2022Updated 3 years ago
- query by humming system☆19Aug 7, 2015Updated 10 years ago