☆20Mar 12, 2025Updated last year
Alternatives and similar repositories for APT
Users that are interested in APT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Sep 14, 2023Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 3 years ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 8 months ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- A Framework for Symbolic MUsic Graph Explanations☆11Jul 30, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Oct 8, 2021Updated 4 years ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆40Oct 26, 2025Updated 7 months ago
- ☆263Feb 14, 2024Updated 2 years ago
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated last year
- The On-the-fly MIDI Augmentation Library!☆32Mar 30, 2025Updated last year
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.☆17Jul 24, 2024Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆36Nov 14, 2025Updated 6 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- A quick introduction to symbolic music processing with partitura☆15Oct 21, 2024Updated last year
- Language modelling for sound event detection☆20Jan 2, 2020Updated 6 years ago
- JEPAs for audio representation learning☆25May 27, 2026Updated last week
- A piano music dataset with Audio, Symbolic and Text labels☆34Mar 6, 2025Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆49Oct 23, 2025Updated 7 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆23Jul 10, 2024Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆47Jan 23, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆72Mar 22, 2026Updated 2 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆90Oct 27, 2025Updated 7 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆46Dec 3, 2024Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Prosodic features for machine-learning applications, in Matlab.☆15Oct 14, 2025Updated 7 months ago
- Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2☆16Jan 30, 2021Updated 5 years ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆153Dec 5, 2024Updated last year
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated last year
- Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022☆54Jul 16, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆51May 24, 2025Updated last year
- ☆50Aug 27, 2024Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆41Jan 6, 2024Updated 2 years ago
- Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification☆25Jun 21, 2022Updated 3 years ago
- Statistical analysis of a dataset of 150,000 Wine Reviews☆15Jul 1, 2021Updated 4 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆17Jul 8, 2020Updated 5 years ago
- Low-latency timbre transfer models for instrumental interaction.☆101Oct 10, 2025Updated 7 months ago