ghost-signal / mynaLinks
Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations
☆17Updated 9 months ago
Alternatives and similar repositories for myna
Users that are interested in myna are comparing it to the libraries listed below
Sorting:
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Updated last year
- Official implementation for FlowSep☆68Updated last year
- Codebase and project page for EDMSound☆35Updated 2 years ago
- ☆45Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆37Updated 2 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆45Updated 7 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50Updated 8 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆113Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Updated last year
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆118Updated 4 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆61Updated 2 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Updated 2 years ago
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Updated last month
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆95Updated 3 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated 2 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆89Updated last month
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- ☆43Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Updated last year
- Adaptive Vocoder for Custom Voice☆61Updated 3 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27Updated 7 months ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆48Updated 3 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Updated last year
- ☆52Updated 6 months ago
- ☆18Updated 8 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated 2 years ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆51Updated 5 months ago