satvik-dixit / maceLinks
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
☆13Updated 10 months ago
Alternatives and similar repositories for mace
Users that are interested in mace are comparing it to the libraries listed below
Sorting:
- ☆72Updated last year
- Official implementation for FlowSep☆68Updated 11 months ago
- ☆25Updated 4 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆44Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆48Updated 4 months ago
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation☆23Updated 2 years ago
- ☆51Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated last month
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated 10 months ago
- Real-time end-to-end singing voice convertion☆22Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 6 months ago
- ☆14Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 8 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆45Updated last month
- Landing Page for Divide and Remaster v3☆23Updated 4 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆49Updated 4 months ago
- ☆32Updated 8 months ago
- ☆20Updated 9 months ago
- ☆13Updated 3 years ago
- ☆27Updated last year
- ☆54Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- ☆11Updated last year
- ☆11Updated last year
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆15Updated 8 months ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year