haoheliu / audioldm_evalView external linksLinks
This toolbox aims to unify audio generation model evaluation for easier comparison.
☆375Sep 29, 2024Updated last year
Alternatives and similar repositories for audioldm_eval
Users that are interested in audioldm_eval are comparing it to the libraries listed below
Sorting:
- AudioLDM training, finetuning, evaluation and inference.☆295Dec 13, 2024Updated last year
- AudioLDM: Generate speech, sound effects, music and beyond, with text.☆2,825Jun 25, 2025Updated 7 months ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆256Jul 25, 2024Updated last year
- Contrastive Language-Audio Pretraining☆2,027May 15, 2025Updated 8 months ago
- Audio Dataset for training CLAP and other models☆729Jan 8, 2026Updated last month
- The Open Source Code of UniAudio☆598Jul 22, 2024Updated last year
- A lightweight library for Frechet Audio Distance calculation.☆308Updated this week
- Learning audio concepts from natural language supervision☆640Sep 18, 2024Updated last year
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"☆365Aug 3, 2023Updated 2 years ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆191Jul 12, 2024Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 10 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆195Dec 13, 2024Updated last year
- Audio generation using diffusion models, in PyTorch.☆2,096Jun 12, 2023Updated 2 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".