satvik-dixit / maceLinks
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
☆13Updated last year
Alternatives and similar repositories for mace
Users that are interested in mace are comparing it to the libraries listed below
Sorting:
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- Official implementation for FlowSep☆69Updated last year
- ☆74Updated last year
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated last year
- ☆57Updated last year
- ☆20Updated 10 months ago
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation☆24Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆37Updated 3 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 months ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- ☆124Updated 11 months ago
- ☆53Updated last year
- ☆27Updated 6 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆50Updated 6 months ago
- ☆32Updated last month
- Repository for "Training Audio Captioning Models without Audio"☆10Updated 2 years ago
- ☆15Updated 11 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- ☆11Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆51Updated 6 months ago
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆20Updated 5 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Updated 10 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆49Updated 2 months ago