satvik-dixit / maceLinks
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
☆12Updated 10 months ago
Alternatives and similar repositories for mace
Users that are interested in mace are comparing it to the libraries listed below
Sorting:
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated 9 months ago
- ☆51Updated last year
- ☆71Updated last year
- ☆14Updated 9 months ago
- ☆54Updated last year
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆43Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- ☆119Updated 9 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆44Updated last week
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation☆22Updated 2 years ago
- ☆31Updated 7 months ago
- Official implementation for FlowSep☆68Updated 10 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆47Updated 3 months ago
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]☆58Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 6 months ago
- Real-time end-to-end singing voice convertion☆22Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆21Updated 3 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆67Updated 5 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 weeks ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆115Updated 3 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆48Updated 3 months ago
- The Multi-band Excited WaveNet☆15Updated 2 years ago
- ☆20Updated 8 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 8 months ago
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆52Updated last month
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆35Updated 3 weeks ago
- ☆23Updated 3 months ago