misya11p / amt-apc
AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model
☆53Updated last week
Related projects ⓘ
Alternatives and complementary repositories for amt-apc
- A lightweight end-to-end text-to-speech model☆90Updated last month
- ☆62Updated 3 weeks ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆117Updated 7 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆126Updated 2 months ago
- Awesome music generation model——MG²☆101Updated this week
- ☆201Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆73Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆124Updated 5 months ago
- Generative models for conditional audio generation☆117Updated 2 months ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆203Updated this week
- Fine-tune Stable Audio Open with DiT ControlNet.☆176Updated 2 months ago
- ☆171Updated 11 months ago
- A toolkit for speaker diarization.☆140Updated 3 weeks ago
- VC Without Retrain!☆102Updated 6 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆219Updated 2 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆234Updated 3 weeks ago
- Music remixer based on MusicGen-Chord☆85Updated 9 months ago
- ☆77Updated 2 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆126Updated last year
- Running the F5-TTS by ONNX Runtime☆27Updated last week
- ☆65Updated 11 months ago
- ☆65Updated 3 weeks ago
- Text-to-Music Generation with Rectified Flow Transformer☆45Updated 2 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆90Updated 6 months ago
- ☆115Updated last month
- API for a Vocal Remover that uses Deep Neural Networks.☆85Updated 4 months ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆150Updated 3 months ago
- Chord conditioning implemented MusicGen☆43Updated 7 months ago
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated this week
- An Open-Sourced LLM-empowered Foundation TTS System☆424Updated 3 weeks ago