☆20Mar 12, 2025Updated last year
Alternatives and similar repositories for APT
Users that are interested in APT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Sep 14, 2023Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 2 years ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 6 months ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Oct 8, 2021Updated 4 years ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 5 months ago
- ☆252Feb 14, 2024Updated 2 years ago
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated 11 months ago
- The On-the-fly MIDI Augmentation Library!☆32Mar 30, 2025Updated 11 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.☆17Jul 24, 2024Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆34Nov 14, 2025Updated 4 months ago
- JEPAs for audio representation learning☆19Jun 22, 2025Updated 9 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 7 months ago
- Language modelling for sound event detection☆20Jan 2, 2020Updated 6 years ago
- A quick introduction to symbolic music processing with partitura☆15Oct 21, 2024Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆34Mar 6, 2025Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 5 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- The official implementation of TokenSynth (ICASSP 2025)☆81Oct 27, 2025Updated 5 months ago
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆70Mar 22, 2026Updated last week
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆44Dec 3, 2024Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 10 months ago
- Prosodic features for machine-learning applications, in Matlab.☆15Oct 14, 2025Updated 5 months ago
- Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2☆16Jan 30, 2021Updated 5 years ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆154Dec 5, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated 10 months ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 8 months ago
- ☆50Aug 27, 2024Updated last year
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification☆25Jun 21, 2022Updated 3 years ago
- Statistical analysis of a dataset of 150,000 Wine Reviews☆15Jul 1, 2021Updated 4 years ago