This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training".
☆36Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for PianistTransformer
Users that are interested in PianistTransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆40Oct 26, 2025Updated 7 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆93Jun 19, 2025Updated 11 months ago
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆49Dec 4, 2025Updated 6 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆37Sep 11, 2025Updated 8 months ago
- Performance MIDI to Score (PM2S)☆79Oct 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The Basis Mixer is an implementation of the Basis Function Modeling framework for musical expression☆22Apr 19, 2024Updated 2 years ago
- ☆18May 14, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 8 months ago
- the infinite ramble in rust, powered by tensorflow. (mfcc cosine similarity matching)☆13Apr 30, 2018Updated 8 years ago
- Training, validation, and inference code for various SSL approaches and architectures.☆88Apr 7, 2026Updated 2 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 3 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 9 months ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- TheGlueNote is representation model for note-wise music alignment.☆13Jul 19, 2024Updated last year
- This program converts .fits file to .jpg. Fits to jpeg.☆13Jun 4, 2018Updated 8 years ago
- Papernotes about Music Information Retrieval☆32Nov 13, 2019Updated 6 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆102Apr 21, 2026Updated last month
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆47Jan 23, 2025Updated last year
- LLMs and VLMs with MLX Swift☆65May 15, 2026Updated 3 weeks ago
- Text-to-text alignment algorithm for speech recognition error analysis.☆30Apr 6, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Music Language Model Generation, Optimization, and Practice☆59Apr 20, 2026Updated last month
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 11 months ago
- ☆72Jun 12, 2025Updated 11 months ago
- 本人的斯坦福CS231n-2024完整作业解决方案☆20Dec 22, 2024Updated last year
- The Audio Score Alignment Test dataset for Ottoman-Turkish makam music☆11Apr 20, 2017Updated 9 years ago
- ☆10Mar 10, 2021Updated 5 years ago
- ☆18Jun 24, 2025Updated 11 months ago
- Bassline generator - Max For Live Midi device☆14Oct 16, 2020Updated 5 years ago
- A JavaScript toolkit for remote net art performance.☆12May 1, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆12Nov 25, 2021Updated 4 years ago
- A song aesthetic evaluation toolkit trained on SongEval.☆307Apr 8, 2026Updated 2 months ago
- An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and in…☆30Feb 6, 2026Updated 4 months ago
- Mandarin Chinese audio datasets aligned with Montreal Forced Aligner☆19Aug 13, 2024Updated last year
- Run Whisper on audio items directly from REAPER and import the text as text items.☆13Jul 8, 2024Updated last year
- A dataset of pitch curves for music performance assessment☆10Jun 5, 2023Updated 3 years ago
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago