This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training".
☆36Mar 30, 2026Updated 3 weeks ago
Alternatives and similar repositories for PianistTransformer
Users that are interested in PianistTransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 6 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆86Jun 19, 2025Updated 10 months ago
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆49Dec 4, 2025Updated 4 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆35Sep 11, 2025Updated 7 months ago
- Performance MIDI to Score (PM2S)☆78Oct 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Basis Mixer is an implementation of the Basis Function Modeling framework for musical expression☆21Apr 19, 2024Updated 2 years ago
- ☆18May 14, 2025Updated 11 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 7 months ago
- the infinite ramble in rust, powered by tensorflow. (mfcc cosine similarity matching)☆14Apr 30, 2018Updated 7 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Training, validation, and inference code for various SSL approaches and architectures.☆85Apr 7, 2026Updated 3 weeks ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 2 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 7 months ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- Music Language Model Generation, Optimization, and Practice☆55Apr 20, 2026Updated last week
- This program converts .fits file to .jpg. Fits to jpeg.☆13Jun 4, 2018Updated 7 years ago
- TheGlueNote is representation model for note-wise music alignment.☆12Jul 19, 2024Updated last year
- Papernotes about Music Information Retrieval☆32Nov 13, 2019Updated 6 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆98Apr 21, 2026Updated last week
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆47Jan 23, 2025Updated last year
- Text-to-text alignment algorithm for speech recognition error analysis.☆29Apr 6, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 10 months ago
- 本人的斯坦福CS231n-2024完整作业解决方案☆21Dec 22, 2024Updated last year
- ☆72Jun 12, 2025Updated 10 months ago
- The Audio Score Alignment Test dataset for Ottoman-Turkish makam music☆11Apr 20, 2017Updated 9 years ago
- ☆10Mar 10, 2021Updated 5 years ago
- Bassline generator - Max For Live Midi device☆14Oct 16, 2020Updated 5 years ago
- ☆18Jun 24, 2025Updated 10 months ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆12Nov 25, 2021Updated 4 years ago
- A JavaScript toolkit for remote net art performance.☆12May 1, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A song aesthetic evaluation toolkit trained on SongEval.☆301Apr 8, 2026Updated 3 weeks ago
- Run Whisper on audio items directly from REAPER and import the text as text items.☆13Jul 8, 2024Updated last year
- Mandarin Chinese audio datasets aligned with Montreal Forced Aligner☆19Aug 13, 2024Updated last year
- A dataset of pitch curves for music performance assessment☆10Jun 5, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 8 months ago
- Devices for Ableton & Max for Live☆10Feb 1, 2026Updated 2 months ago