satvik-dixit / maceLinks
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
☆12Updated 9 months ago
Alternatives and similar repositories for mace
Users that are interested in mace are comparing it to the libraries listed below
Sorting:
- ☆69Updated last year
- ☆51Updated 11 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 5 months ago
- ☆52Updated last year
- Official implementation for FlowSep☆64Updated 9 months ago
- ☆118Updated 8 months ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated 9 months ago
- ☆30Updated 7 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆43Updated 5 months ago
- ☆11Updated 11 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆45Updated 3 months ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆109Updated 2 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Real-time end-to-end singing voice convertion☆22Updated 11 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆48Updated 3 months ago
- ☆23Updated last week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- ☆26Updated last year
- ☆60Updated 9 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆104Updated 10 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆41Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated this week
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation☆22Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- ☆13Updated 8 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Updated 9 months ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated 9 months ago