hmjz100 / MT3Links
MT3:多任务多音轨音乐转录的 Gradio 演示。(全中文汉化)
☆12Updated 5 months ago
Alternatives and similar repositories for MT3
Users that are interested in MT3 are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal…☆12Updated last year
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆68Updated 3 years ago
- fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CU…☆22Updated last year
- ☆51Updated 10 months ago
- Official source codes of airsep☆38Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆33Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆73Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆48Updated last month
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆34Updated last week
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Updated last year
- A minimum inference engine for DiffSinger☆34Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆12Updated last year
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆99Updated last month
- singing voice conversion without f0☆23Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated last month
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 2 years ago
- Audio-to-Score Alignment Using Deep Automatic Music Transcription☆42Updated 2 years ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆60Updated 3 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆26Updated last year
- Vocal Remover using Deep Neural Networks☆17Updated 8 months ago
- [AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics☆18Updated last year
- Extension of Sinsy-NG using deep learning models for voice conversion in order to synthesize good and realistic vocals.☆13Updated 5 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- ☆11Updated 10 months ago
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆38Updated 3 months ago
- Pitch Controllable DDSP Vocoders☆77Updated 10 months ago
- Code for unmixing audio signals in four different stems "drums, bass, vocals, others". The code is adapted from "Jukebox: A Generative Mo…☆36Updated 3 years ago