frankenliu / LOAEView external linksLinks
☆11Sep 25, 2024Updated last year
Alternatives and similar repositories for LOAE
Users that are interested in LOAE are comparing it to the libraries listed below
Sorting:
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- ☆34Jun 9, 2025Updated 8 months ago
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆28Oct 25, 2024Updated last year
- ☆22Mar 19, 2025Updated 10 months ago
- Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval…☆21Feb 1, 2023Updated 3 years ago
- ☆50Aug 27, 2024Updated last year
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- [ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?☆42Nov 21, 2025Updated 2 months ago
- Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆62Feb 7, 2026Updated last week
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆56Jun 12, 2025Updated 8 months ago
- [AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models☆27Dec 14, 2023Updated 2 years ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆31Dec 6, 2023Updated 2 years ago
- ☆23Sep 10, 2025Updated 5 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated 11 months ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆38Nov 20, 2024Updated last year
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 7 months ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 4 years ago
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆53Nov 3, 2025Updated 3 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆96Nov 9, 2024Updated last year
- A list of papers about audio captioning☆79Jul 1, 2022Updated 3 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- ☆37Jul 4, 2024Updated last year
- Official Implementation of GLAP - General Language Audio Pretraining☆61Jan 5, 2026Updated last month
- In Divisive we have all points in one cluster initially and we break the cluster into required number of clusters.☆10May 19, 2018Updated 7 years ago
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Oct 8, 2021Updated 4 years ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 3 months ago
- An evolutionary algorithm that generates an accompaniment to a given melody that consists of triad chords while following music theory ru…☆10Sep 19, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- Audio captioning recipe☆51Oct 23, 2025Updated 3 months ago
- ☆13Jun 2, 2022Updated 3 years ago
- ☆12Jun 1, 2024Updated last year
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 6 months ago
- An implementation of the self-tuning spectral clustering algorithm described in Zelnik-Manor and Perona (2004)☆12Mar 27, 2018Updated 7 years ago
- ☆11Apr 30, 2025Updated 9 months ago